Leaderboard
Real benchmarks from real hardware. Compare local LLM performance across different setups. Run the command below to contribute yours.
$
npm install -g metrillm@latest$
metrillmRequires Node 20+ and Ollama or LM Studio running
Or run without installing: npx metrillm@latest
Trending
Last 7 days- 1 gemma4:31b 1
- 1 MacBook Pro 1
- 1 Apple M4 Max 1
Benchmarks
244
total runs
Models
173
unique models
Families
42
model families
Hardware
17
unique CPUs
Filters
244 results
| # | CPU | Model | Size | RAM | Runtime | tok/s | TTFT | HW Fit | Quality | Global | Flags | Verdict | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Apple M4 Pro | gpt-oss-20b | — | 64 GB | LM-STUDIOGGUF | 118.2 | 2.2 s | 94 | 95 | 95 | THINK | Excellent | ||
| 2 | Intel Core™ Ultra 9 285K | gpt-oss:20b | 20.9B/MXFP4 | 47 GB | OLLAMAGGUF | 239.9 | 616 ms | 98 | 94 | 95 | THINK | Excellent | ||
| 3 | Apple M4 | openai/gpt-oss-20b | 20B/MXFP4 | 24 GB | LM-STUDIOMLX | 29.3 | 1.8 s | 99 | 93 | 95 | THINK | Excellent | ||
| 4 | Apple M4 | openai/gpt-oss-20b | 20B/MXFP4 | 32 GB | LM-STUDIOMLX | 40.7 | 1.3 s | 100 | 92 | 94 | THINK | Excellent | ||
| 5 | Apple M4 | qwen/qwen3-30b-a3b-2507 | 30B/4bit | 32 GB | LM-STUDIOMLX | 44.8 | 405 ms | 100 | 91 | 94 | Excellent | |||
| 6 | Apple M4 Pro | nemotron-3-nano | — | 64 GB | LM-STUDIOGGUF | 93.3 | 257 ms | 100 | 90 | 93 | THINK | Excellent | ||
| 7 | AMD RYZEN AI MAX+ 395 | gpt-oss:20b | 20.9B/MXFP4 | 125 GB | OLLAMAGGUF | 47.1 | 1.8 s | 85 | 96 | 93 | ECOTHINK | Excellent | ||
| 8 | Intel Core™ Ultra 9 285K | qwen3-coder:30b | 30.5B/Q4_K_M | 47 GB | OLLAMAGGUF | 207.9 | 513 ms | 96 | 87 | 90 | Excellent | |||
| 9 | Apple M4 | unsloth/gemma-4-26b-a4b-it | 26B/Q3_K_M | 24 GB | LM-STUDIOGGUF | 24.9 | 817 ms | 85 | 92 | 90 | Excellent | |||
| 10 | Apple M1 Max | qwen3:14b | 14.8B/Q4_K_M | 32 GB | OLLAMAGGUF | 24.0 | 300 ms | 97 | 87 | 90 | Excellent | |||
| 11 | AMD RYZEN AI MAX+ 395 | nemotron-3-nano:latest | 31.6B/Q4_K_M | 125 GB | OLLAMAGGUF | 65.2 | 295 ms | 100 | 84 | 89 | Excellent | |||
| 12 | AMD RYZEN AI MAX+ 395 | gemma4:26b | 25.8B/Q4_K_M | 125 GB | OLLAMAGGUF | 48.3 | 346 ms | 95 | 87 | 89 | Excellent | |||
| 13 | Apple M4 Pro | qwen/qwen3-8b | 8B/4bit | 64 GB | LM-STUDIOMLX | 33.4 | 257 ms | 95 | 87 | 89 | THINK | Excellent | ||
| 14 | AMD RYZEN AI MAX+ 395 | lfm2:24b | 23.8B/Q4_K_M | 125 GB | OLLAMAGGUF | 95.4 | 202 ms | 100 | 83 | 88 | Excellent | |||
| 15 | AMD RYZEN AI MAX+ 395 | lfm2:24b | 23.8B/Q4_K_M | 125 GB | OLLAMAGGUF | 85.3 | 123 ms | 100 | 83 | 88 | Excellent | |||
| 16 | Apple M4 | gpt-oss-orchestrator:latest | 20.9B/MXFP4 | 24 GB | OLLAMAGGUF | 24.6 | 4.8 s | 80 | 91 | 88 | THINK | Excellent | ||
| 17 | AMD RYZEN AI MAX+ 395 | qwen3.5:35b-a3b | 36.0B/Q4_K_M | 125 GB | OLLAMAGGUF | 42.0 | 243 ms | 90 | 87 | 88 | Excellent | |||
| 18 | Apple M4 | gpt-oss:20b | 20.9B/MXFP4 | 24 GB | OLLAMAGGUF | 24.3 | 4.5 s | 83 | 88 | 87 | THINK | Excellent | ||
| 19 | Apple M4 | gpt-oss-safeguard-20b-mlx | 20B/MXFP4 | 24 GB | LM-STUDIOMLX | 41.4 | 3.6 s | 82 | 89 | 87 | THINK | Excellent | ||
| 20 | AMD RYZEN AI MAX+ 395 | nemotron-3-nano:latest | 31.6B/Q4_K_M | 125 GB | OLLAMAGGUF | 59.9 | 199 ms | 100 | 82 | 87 | Excellent |
...
Don't miss new benchmarks
Get notified when new models and hardware configurations are tested. No spam, unsubscribe anytime.