Local LLM Hardware Guides: Which Machine Runs Which Model

Local LLM Hardware Guides: Which Machine Runs Which Model

Every popular local LLM, and exactly which hardware runs it. Each guide is a full fit matrix: which GPUs and machines hold the model, the best GGUF quant, the file size, and theoretical plus owner-measured tokens per second. The numbers are computed from the same engine as our Can I Run It? calculator, so they match the tool exactly.

Not sure where to start? If you know your hardware, the fastest path is the calculator (it tests any model against your exact machine). These pages are the reverse: pick the model, see every machine that runs it.

The guides, by how much memory they need

Runs on almost any GPU (12 GB and up)

Needs about 16 GB

Needs about 24 GB

Needs 32 GB or more

Needs a 64 GB+ unified-memory box

Frontier scale: a 256 GB+ box, a multi-GPU server, or the cloud

Or use the tools directly

Get the Vetted Consumer newsletter

Reviews, buying advice, and field notes. Delivered monthly.

Almost there, check your inbox and click the confirmation link. ✓

Something went wrong, please try again, or email [email protected].