starter models
what local ai
What local AI can I run on a 16 GB laptop?
Typical 16 GB laptops where the real question is whether compact reasoning and coding models stay realistic locally. This page uses the maintained catalog and a calibrated hardware band to answer the common hardware-search version of the question without pretending that a public shared-device cluster already exists.
benchmark first
Use the benchmark before you trust the reference band.
This band is strong enough for serious 13B-class work, but 34B-class pulls still need more memory and steadier graphics headroom. The benchmark turns this reference guide into a machine-specific answer before you spend time downloading models that are too large for the actual browser-visible hardware.
runtime paths
Pick the runtime after you confirm the size band
Runtime choice comes second here. Use the benchmark to confirm the model size band, then use the runtime pages for the cleanest first pull inside Ollama, LM Studio, or llama.cpp.
Runtime page
Best local models for Ollama
Search intent: ollama best model
Best for the quickest path from benchmark result to a real local run.
Runtime guide + catalog coverage
Runtime page
Best local models for llama.cpp
Search intent: llama.cpp best model
Best for people who care about low-level control, serving flags, and GGUF tuning.
Runtime guide + catalog coverage
Runtime page
Best local models for LM Studio
Search intent: lm studio best model
Best for people who want a graphical model browser and easy GGUF pulls.
Runtime guide + catalog coverage
if you want more
Next-step models beyond this starting band
This band is already near the top of the current catalog, so the benchmark result matters more than another static stretch guess.
why this page is careful
Reference band, not fake proof
- Best for: Typical 16 GB laptops where the real question is whether compact reasoning and coding models stay realistic locally.
- Tradeoff: This band is strong enough for serious 13B-class work, but 34B-class pulls still need more memory and steadier graphics headroom.
- Calibration note: Solid for local chat and coding assistants when quantization is aggressive.
- Public-proof boundary: Specific device pages stay gated until shared benchmark evidence is strong enough to index safely.
evidence sources