Show HN: Quant Picker – which GGUF file fits your model and machine

wpnews.pro

cd /news/artificial-intelligence/show-hn-quant-picker-which-gguf-file… · home › topics › artificial-intelligence › article

[ARTICLE · art-26145] src=vettedconsumer.com ↗ pub=2026-06-13T11:34Z topic=artificial-intelligence verified=true sentiment=· neutral

Show HN: Quant Picker – which GGUF file fits your model and machine

Quant Picker is a new tool that calculates which GGUF quantization level fits a given model and machine, balancing file size, quality, and context budget. It recommends the highest quantization that leaves at least 8k context, based on community consensus from a quantization guide.

read1 min views17 publishedJun 13, 2026

How to read the table #

Every GGUF model ships in multiple quantization levels — same model, different precision, different file size. The trade is simple: more bits = better quality = bigger file = less room left for context. This tool does the arithmetic for your exact machine: file size per quant, then whatever memory remains becomes your context budget (the KV cache eats it per token).

The recommendation logic is the community consensus from our quantization guide: take the highest quant that still leaves ≥8k of context. Q6/Q5 are near-lossless, Q4_K_M is the sweet spot, and below Q3 quality falls off fast — if you're forced down there, you usually want a smaller model instead (a bigger model at Q4 beats a smaller one at Q8, but a Q2 of anything beats very little).

Honest limits #

File sizes are computed from bits-per-weight, not scraped from Hugging Face — real files vary a little by quantizer version (K-quants vs I-quants, imatrix variants). The KV-cache math assumes a GQA-typical architecture; exotic models differ. And max context here is what fits — models also have their own context limits, and quality at extreme context is its own story. Treat the numbers as a reliable guide, not a contract.

The tool family #

Shopping rather than down? Can I run it? finds hardware that fits a model. Wondering if you should buy hardware at all? The cost calculator compares buying vs renting vs the API.

source & further reading

vettedconsumer.com — original article Speculative Decoding, Explained: The Free Speed Toggle Your Local LLM Is Probably Not Using What Hardware Runs Inkling? A 975B Model That Fits on One Box (Unlike Kimi K3) Inkling: Mira Murati's First Open Model Is a 975B MoE You Can Actually Run

~/api · this article 200

$curl api.wpnews.pro/v1/news/show-hn-quant-picker-whi…

Read original on vettedconsumer.com → vettedconsumer.com/quant-picker/

mentioned entities

Quant Picker

GGUF

Hugging Face

KV cache

metadata

slugshow-hn-quant-picker-which-gguf-file-fits-your-model-and-machine

topic#artificial-intelligence

secondary3 topics

sentimentneutral

canonicalvettedconsumer.com

navigation

← prevEU budget 2027 €200B: 75B for "c…

next →Jordi Visser: SpaceX’s valuation…

── more in #artificial-intelligence 4 stories · sorted by recency

officechai.com · 29 Jul · #artificial-intelligence

Kimi Maker Moonshot AI Valued At $35 Billion After $3.5 Billion Fundraise

dev.to · 29 Jul · #artificial-intelligence

AI-Driven Development: Transforming Software Workflows in 2026

blog.apify.com · 29 Jul · #artificial-intelligence

Marketing intelligence tools: build your own AI pipeline

infoq.com · 29 Jul · #artificial-intelligence

Presentation: Getting Rid of LeetCode Interviews in the World of AI

── more on @quant picker 3 stories trending now

wpnews · 16 Jul · #artificial-intelligence

Women entrepreneurs are less likely to leverage AI—but more likely to benefit from it

wpnews · 28 Jul · #artificial-intelligence

How Claude Code and VS Code turned Anthropic from a safety lab into a developer phenomenon

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required