Apple M4
MacBook Air
32 GB RAM · 90 models tested
Avg Score
65 /100
Avg Speed
47.3 tok/s
Models Tested
90
| Model | Size | Score | tok/s | TTFT | Verdict |
|---|---|---|---|---|---|
| qwen2.5:7b | 7.6B Q4_K_M | 85 | 22.2 | 285 ms | Excellent |
| gemma3:4b | 4.3B Q4_K_M | 83 | 34.6 | 303 ms | Excellent |
| qwen2.5:14b | 14.8B Q4_K_M | 82 | 11.2 | 539 ms | Excellent |
| qwen3.5:4b | 4.7B Q4_K_M | 81 | 16.6 | 369 ms | Excellent |
| qwen2.5-coder:7b | 7.6B Q4_K_M | 81 | 22.1 | 285 ms | Excellent |
| gemma2:2b | 2.6B Q4_0 | 80 | 55.1 | 152 ms | Excellent |
| yi:6b | 6B Q4_0 | 80 | 27.6 | 210 ms | Excellent |
| gemma3:12b | 12.2B Q4_K_M | 80 | 12.7 | 560 ms | Excellent |
| qwen2.5:3b | 3.1B Q4_K_M | 80 | 47.2 | 168 ms | Excellent |
| gemma2:9b | 9.2B Q4_0 | 78 | 17.5 | 347 ms | Good |
| llama3.2:3b | 3.2B Q4_K_M | 78 | 44.1 | 178 ms | Good |
| llama3.1:8b | 8.0B Q4_K_M | 77 | 20.3 | 315 ms | Good |
| qwen3:0.6b | 751.63M Q4_K_M | 77 | 145.2 | 1.4s | Good |
| qwen3.5:2b | 2.3B Q8_0 | 77 | 30.4 | 215 ms | Good |
| granite3.1-dense:8b | 8.2B Q4_K_M | 76 | 18.8 | 308 ms | Good |
| mistral:latest | 7.2B Q4_K_M | 76 | 22.5 | 257 ms | Good |
| qwen3:0.6b | 751.63M Q4_K_M | 75 | 143.9 | 1.4s | Good |
| phi4:14b | 14.7B Q4_K_M | 75 | 11.0 | 532 ms | Good |
| phi4-mini:latest | 3.8B Q4_K_M | 75 | 36.2 | 190 ms | Good |
| mistral:7b | 7.2B Q4_K_M | 75 | 22.0 | 261 ms | Good |
| llama3:latest | 8.0B Q4_0 | 74 | 22.3 | 295 ms | Good |
| qwen2.5:7b | 7.6B Q4_K_M | 74 | 20.5 | 295 ms | Good |
| codegemma:7b | 9B Q4_0 | 74 | 18.7 | 320 ms | Good |
| qwen2.5:1.5b | 1.5B Q4_K_M | 74 | 84.1 | 119 ms | Good |
| mistral-nemo:12b | 12.2B Q4_0 | 73 | 14.6 | 371 ms | Good |
| deepseek-v2:16b | 15.7B Q4_0 | 73 | 56.6 | 239 ms | Good |
| gemma2:2b | 2.6B Q4_0 | 72 | 53.8 | 170 ms | Good |
| gemma3:4b | 4.3B Q4_K_M | 72 | 35.9 | 303 ms | Good |
| gemma3:1b | 999.89M Q4_K_M | 72 | 39.5 | 333 ms | Good |
| gemma3:1b | 999.89M Q4_K_M | 72 | 39.4 | 362 ms | Good |
| granite3.1-dense:2b | 2.5B Q4_K_M | 72 | 54.6 | 121 ms | Good |
| nous-hermes2:latest | 11B Q4_0 | 71 | 15.9 | 345 ms | Good |
| neural-chat:7b | 7B Q4_0 | 71 | 22.5 | 242 ms | Good |
| qwen2.5:3b | 3.1B Q4_K_M | 71 | 38.5 | 204 ms | Good |
| aya:8b | 8.0B F16 | 70 | 20.0 | 300 ms | Good |
| qwen2.5-coder:7b | 7.6B Q4_K_M | 70 | 21.5 | 290 ms | Good |
| qwen3:1.7b | 2.0B Q4_K_M | 70 | 37.9 | 3.1s | Good |
| qwen3.5:0.8b | 873.44M Q8_0 | 70 | 48.2 | 195 ms | Good |
| llama3.2:3b | 3.2B Q4_K_M | 69 | 27.8 | 285 ms | Good |
| llama3.1:8b | 8.0B Q4_K_M | 69 | 20.2 | 481 ms | Good |
| gemma2:9b | 9.2B Q4_0 | 69 | 16.8 | 372 ms | Good |
| smollm2:1.7b | 1.7B Q8_0 | 69 | 51.8 | 84 ms | Good |
| phi3:3.8b | 3.8B Q4_0 | 69 | 42.0 | 142 ms | Good |
| llama3:latest | 8.0B Q4_0 | 68 | 21.4 | 297 ms | Good |
| qwen2.5:1.5b | 1.5B Q4_K_M | 68 | 81.5 | 119 ms | Good |
| codellama:7b | 7B Q4_0 | 68 | 23.6 | 297 ms | Good |
| starling-lm:7b | 7B Q4_0 | 68 | 23.7 | 234 ms | Good |
| granite3.1-dense:2b | 2.5B Q4_K_M | 67 | 53.4 | 122 ms | Good |
| gemma3:12b | 12.2B Q4_K_M | 67 | 12.3 | 656 ms | Good |
| qwen3:0.6b | 751.63M Q4_K_M | 66 | 144.3 | 81 ms | Good |
| dolphin-phi:2.7b | 3B Q4_0 | 66 | 56.1 | 108 ms | Good |
| solar:10.7b | 11B Q4_0 | 66 | 15.8 | 353 ms | Good |
| phi3:14b | 14.0B Q4_0 | 66 | 12.7 | 434 ms | Good |
| phi:2.7b | 3B Q4_0 | 65 | 56.6 | 110 ms | Good |
| phi4:14b | 14.7B Q4_K_M | 65 | 11.1 | 515 ms | Good |
| vicuna:7b | 7B Q4_0 | 65 | 23.4 | 242 ms | Good |
| codegemma:7b | 9B Q4_0 | 65 | 19.2 | 316 ms | Good |
| wizardlm2:7b | 7B Q4_0 | 65 | 23.2 | 264 ms | Good |
| gemma3:1b | 999.89M Q4_K_M | 65 | 67.5 | 666 ms | Good |
| llama3.2:1b | 1.2B Q8_0 | 63 | 69.6 | 169 ms | Good |
| deepseek-coder:6.7b | 7B Q4_0 | 63 | 24.1 | 251 ms | Good |
| llama3.2:1b | 1.2B Q8_0 | 63 | 28.2 | 199 ms | Good |
| smollm2:1.7b | 1.7B Q8_0 | 63 | 50.3 | 81 ms | Good |
| qwen2.5:0.5b | 494.03M Q4_K_M | 63 | 165.0 | 81 ms | Good |
| vicuna:13b | 13B Q4_0 | 62 | 13.4 | 438 ms | Good |
| mistral:7b | 7.2B Q4_K_M | 61 | 13.6 | 517 ms | Good |
| gemma3:1b | 999.89M Q4_K_M | 61 | 87.9 | 2.3s | Good |
| llama2:7b | 7B Q4_0 | 61 | 25.1 | 228 ms | Good |
| orca-mini:7b | 7B Q4_0 | 61 | 24.4 | 219 ms | Good |
| llama2:13b | 13B Q4_0 | 60 | 13.5 | 415 ms | Good |
| qwen2.5:0.5b | 494.03M Q4_K_M | 59 | 162.7 | 81 ms | Marginal |
| smollm2:360m | 361.82M F16 | 59 | 117.9 | 43 ms | Marginal |
| smollm2:360m | 361.82M F16 | 58 | 115.1 | 39 ms | Marginal |
| orca-mini:3b | 3B Q4_0 | 54 | 30.6 | 126 ms | Marginal |
| stablelm2:1.6b | 2B Q4_0 | 53 | 93.8 | 94 ms | Marginal |
| tinyllama:1.1b | 1B Q4_0 | 51 | 121.0 | 82 ms | Marginal |
| tinyllama:1.1b | 1B Q4_0 | 50 | 129.6 | 58 ms | Marginal |
| qwen3:8b | 8.2B Q4_K_M | 50 | 18.7 | 12.1s | Marginal |
| smollm2:135m | 134.52M F16 | 49 | 241.0 | 39 ms | Marginal |
| smollm2:135m | 134.52M F16 | 48 | 245.0 | 31 ms | Marginal |
| starcoder2:3b | 3B Q4_0 | 48 | 49.4 | 155 ms | Marginal |
| qwen3:4b | 4.0B Q4_K_M | 48 | 11.3 | 9.2s | Marginal |
| stablelm2:1.6b | 2B Q4_0 | 48 | 62.8 | 126 ms | Marginal |
| falcon:7b | 7B Q4_0 | 47 | 24.3 | 256 ms | Marginal |
| starcoder2:7b | 7B Q4_0 | 44 | 23.8 | 251 ms | Marginal |
| qwen3:8b | 8.2B Q4_K_M | 42 | 18.2 | 12.4s | Marginal |
| phi4-mini | 39 | 21.8 | 9.6s | Not Rec. | |
| phi4-mini | 38 | 21.7 | 11.9s | Not Rec. | |
| deepseek-r1:14b | 14.8B Q4_K_M | 26 | 11.4 | 30.0s | Not Rec. |
| deepseek-r1:14b | 14.8B Q4_K_M | 25 | 10.8 | 30.0s | Not Rec. |