Apple M4

Mac mini

24 GB RAM · 143 models tested

Avg Score

66 /100

Avg Speed

42.6 tok/s

Models Tested

143

Model	Size	Score	tok/s	TTFT	Verdict
openai/gpt-oss-20b	20B MXFP4	95	29.3	1.8s	Excellent
openai/gpt-oss-20b	20B MXFP4	94	40.7	1.3s	Excellent
qwen/qwen3-30b-a3b-2507	30B 4bit	94	44.8	405 ms	Excellent
unsloth/gemma-4-26b-a4b-it	26B Q3_K_M	90	24.9	817 ms	Excellent
gpt-oss-orchestrator:latest	20.9B MXFP4	88	24.6	4.8s	Excellent
gpt-oss:20b	20.9B MXFP4	87	24.3	4.5s	Excellent
gpt-oss-safeguard-20b-mlx	20B MXFP4	87	41.4	3.6s	Excellent
gpt-oss:20b	20.9B MXFP4	87	26.4	6.7s	Excellent
gemma4:26b	25.8B Q4_K_M	85	24.5	1.5s	Excellent
qwen2.5:7b	7.6B Q4_K_M	84	22.2	285 ms	Excellent
unsloth/gemma-4-26b-a4b-it	26B Q3_K_M	84	16.5	2.2s	Excellent
qwen2.5-7b-instruct	7B 4bit	84	24.4	464 ms	Excellent
nvidia/nemotron-3-nano	30B 4bit	84	47.9	465 ms	Excellent
qwen2.5:14b	14.8B Q4_K_M	83	11.2	539 ms	Excellent
gpt-oss:20b	20.9B MXFP4	83	17.1	10.2s	Excellent
qwen3:14b	14.8B Q4_K_M	83	11.1	500 ms	Excellent
openai/gpt-oss-20b	20B MXFP4	82	31.8	8.4s	Excellent
qwen2.5-7b-instruct	7B 4bit	82	22.0	477 ms	Excellent
qwen3:14b	14.8B Q4_K_M	82	10.3	513 ms	Excellent
qwen3.5-9b	9B 4bit	81	20.4	3.0s	Excellent
qwen3.5:9b	9.7B Q4_K_M	80	11.8	523 ms	Excellent
gemma3:12b	12.2B Q4_K_M	80	12.7	560 ms	Excellent
mlx-community/gemma-3-4b-it-qat-4bit		80	43.2	428 ms	Excellent
gemma3:4b	4.3B Q4_K_M	80	34.6	303 ms	Excellent
qwen3.5:4b	4.7B Q4_K_M	80	16.6	369 ms	Excellent
qwen2.5-coder-7b-instruct-mlx	7B 4bit	79	22.5	529 ms	Good
qwen2.5-coder:7b	7.6B Q4_K_M	79	22.1	285 ms	Good
yi:6b	6B Q4_0	77	27.6	210 ms	Good
qwen2.5:3b	3.1B Q4_K_M	77	47.2	168 ms	Good
gemma2:2b	2.6B Q4_0	77	55.1	152 ms	Good
deepseek-r1:7b	7.6B Q4_K_M	77	22.2	285 ms	Good
internlm2:7b	7.7B Q4_0	77	22.7	250 ms	Good
openai/gpt-oss-20b	20B MXFP4	77	30.5	30.0s	Not Rec.
cogito:8b	8.0B Q4_K_M	77	21.0	312 ms	Good
gemma2:9b	9.2B Q4_0	76	17.5	347 ms	Good
ministral-3:8b	8.9B Q4_K_M	76	19.5	328 ms	Good
glm4:9b	9.4B Q4_0	76	19.0	290 ms	Good
google/gemma-3-4b	4B 4bit	76	39.4	468 ms	Good
qwen3.5:27b	27.8B Q4_K_M	75	4.4	1.1s	Good
exaone-3.5-2.4b-instruct-mlx	2.4B 8bit	75	37.0	288 ms	Good
phi4:14b	14.7B Q4_K_M	75	11.0	532 ms	Good
qwen3.5-27b	27B 4bit	75	6.6	4.4s	Good
llama3.1:8b	8.0B Q4_K_M	75	20.3	315 ms	Good
granite3.1-dense:8b	8.2B Q4_K_M	74	18.8	308 ms	Good
minimax-m2.7:cloud		74	0.0	11.4s	Not Rec.
liquid/lfm2-24b-a2b	24B 4bit	74	51.8	503 ms	Good
mlx-community/Llama-3.2-3B-Instruct-4bit		74	50.6	257 ms	Good
llama3.2:3b	3.2B Q4_K_M	74	44.1	178 ms	Good
hermes3:8b	8.0B Q4_0	74	22.3	296 ms	Good
dolphin3:8b	8.0B Q4_K_M	74	21.0	314 ms	Good
liquid/lfm2-24b-a2b	24B 4bit	74	58.9	481 ms	Good
mistralai/magistral-small-2509	24B 4bit	74	7.9	1.4s	Good
qwen3.5:2b	2.3B Q8_0	73	30.4	215 ms	Good
qwen2.5-coder-3b-instruct-mlx	3B 4bit	73	52.3	285 ms	Good
mlx-community/Yi-1.5-6B-Chat-4bit		73	29.1	451 ms	Good
qwen3:0.6b	751.63M Q4_K_M	73	145.2	1.4s	Good
mistral:latest	7.2B Q4_K_M	73	22.5	257 ms	Good
mlx-community/gemma-3-1b-it-8bit		72	86.3	253 ms	Good
mistral:7b	7.2B Q4_K_M	72	22.0	261 ms	Good
qwen3.5-9b	9B Q4_K_S	72	14.4	824 ms	Not Rec.
aya-expanse:8b	8.0B Q4_K_M	72	19.7	350 ms	Good
codegemma:7b	9B Q4_0	71	18.7	320 ms	Good
mistral-nemo:12b	12.2B Q4_0	71	14.6	371 ms	Good
ministral-3:3b	3.8B Q4_K_M	71	42.2	189 ms	Good
llama3:latest	8.0B Q4_0	71	22.3	295 ms	Good
qwen3.5-4b-mlx	4B 4bit	71	23.1	3.3s	Good
google/gemma-3-27b	27B 4bit	71	5.9	1.8s	Not Rec.
phi4-mini:latest	3.8B Q4_K_M	71	36.2	190 ms	Good
qwen2.5-coder-1.5b-instruct-mlx	1.5B 8bit	70	58.4	234 ms	Good
qwen2.5:1.5b	1.5B Q4_K_M	70	84.1	119 ms	Good
mistral:7b	7.2B Q4_K_M	70	22.5	252 ms	Good
qwen3.5-2b-mlx	2B 8bit	70	32.9	3.1s	Good
yi-coder:9b	8.8B Q4_0	70	19.4	287 ms	Good
deepseek-v2:16b	15.7B Q4_0	69	56.6	239 ms	Good
qwen2.5-1.5b-instruct	1.5B 8bit	69	58.1	211 ms	Good
mistralai/devstral-small-2-2512	24B 4bit	69	5.7	5.0s	Good
lfm2.5-1.2b-instruct-mlx	1.2B 8bit	68	77.3	177 ms	Good
granite3.1-dense:2b	2.5B Q4_K_M	68	54.6	121 ms	Good
nous-hermes2:latest	11B Q4_0	68	15.9	345 ms	Good
gemma3:1b	999.89M Q4_K_M	68	39.4	362 ms	Good
gemma3:1b	999.89M Q4_K_M	68	39.5	333 ms	Good
cogito:3b	3.6B Q4_K_M	67	45.6	177 ms	Good
neural-chat:7b	7B Q4_0	67	22.5	242 ms	Good
aya:8b	8.0B F16	66	20.0	300 ms	Good
granite-3.3-2b-instruct	2B bf16	66	18.3	541 ms	Good
qwen3.5:27b	27.8B Q4_K_M	66	4.0	1.7s	Good
deepseek-r1:1.5b	1.8B Q4_K_M	66	85.4	115 ms	Good
qwen3.5:27b	27.8B Q4_K_M	66	3.8	1.6s	Good
smollm2-1.7b-instruct	1.7B bf16	65	28.5	393 ms	Good
starling-lm:7b	7B Q4_0	65	23.8	232 ms	Good
qwen3.5:0.8b	873.44M Q8_0	65	48.2	195 ms	Good
smollm2:1.7b	1.7B Q8_0	64	51.8	84 ms	Good
phi3:3.8b	3.8B Q4_0	64	42.0	142 ms	Good
solar:10.7b	11B Q4_0	63	15.8	353 ms	Good
starling-lm:7b	7B Q4_0	63	23.7	234 ms	Good
phi3:14b	14.0B Q4_0	63	12.7	434 ms	Good
codellama:7b	7B Q4_0	63	23.6	297 ms	Good
microsoft/phi-4-mini-reasoning	3.8B 4bit	62	40.3	289 ms	Good
dolphin-phi:2.7b	3B Q4_0	61	56.1	108 ms	Good
qwen3:0.6b	751.63M Q4_K_M	61	144.3	81 ms	Good
qwen3.5-0.8b-mlx	0.8B 8bit	60	108.0	2.6s	Good
text-embedding-nomic-embed-text-v1.5	Q4_K_M	59	45.3	2.4s	Marginal
vicuna:7b	7B Q4_0	59	23.4	242 ms	Marginal
vicuna:13b	13B Q4_0	59	13.4	438 ms	Marginal
qwen3.5-9b-mlx	9B 4bit	59	14.3	13.4s	Marginal
wizardlm2:7b	7B Q4_0	59	23.2	264 ms	Marginal
phi:2.7b	3B Q4_0	59	56.6	110 ms	Marginal
qwen3.5-9b-mlx	9B 4bit	58	16.7	14.1s	Marginal
qwen2.5:0.5b	494.03M Q4_K_M	57	165.0	81 ms	Marginal
mlx-community/Llama-3.2-1B-Instruct-4bit		57	126.4	139 ms	Marginal
deepseek-coder:6.7b	7B Q4_0	57	24.1	251 ms	Marginal
llama2:13b	13B Q4_0	56	13.5	415 ms	Marginal
orca-mini:7b	7B Q4_0	55	24.4	219 ms	Marginal
phi-3.5-mini-instruct	4bit	55	43.4	359 ms	Marginal
qwen3.5-0.8b-mlx	0.8B 4bit	55	68.2	3.3s	Marginal
llama2:7b	7B Q4_0	55	25.1	228 ms	Marginal
falcon3-1b-instruct	1B 3bit	54	121.8	176 ms	Marginal
mlx-community/Nanbeige4.1-3B-8bit		53	25.4	266 ms	Marginal
qwen2.5-0.5b-instruct-mlx	0.5B 4bit	52	257.5	99 ms	Marginal
smollm2:360m	361.82M F16	52	117.9	43 ms	Marginal
smollm2-360m-instruct	360M bf16	52	121.1	106 ms	Marginal
mlx-community/stablelm-2-zephyr-1_6b-4bit		50	93.6	166 ms	Marginal
qwen3.5:4b	4.7B Q4_K_M	48	18.9	13.0s	Marginal
orca-mini:3b	3B Q4_0	47	30.6	126 ms	Marginal
qwen3:4b	4.0B Q4_K_M	47	11.3	9.2s	Marginal
qwen3:8b	8.2B Q4_K_M	47	18.7	12.1s	Marginal
stablelm2:1.6b	2B Q4_0	45	93.8	94 ms	Marginal
qwen2.5-math-1.5b-instruct	1.5B 4bit	45	91.7	205 ms	Marginal
mlx-community/quantized-gemma-2b-it		45	44.4	336 ms	Marginal
tinyllama:1.1b	1B Q4_0	43	121.0	82 ms	Marginal
tinyllama		43	133.4	61 ms	Marginal
gemma-3-270m-it-qat-mlx	270M 4bit	43	344.2	173 ms	Marginal
smollm2:135m	134.52M F16	41	241.0	39 ms	Marginal
starcoder2:3b	3B Q4_0	39	49.4	155 ms	Not Rec.
phi-3-mini-128k-instruct	4bit	38	44.0	385 ms	Not Rec.
deepseek-r1-distill-qwen-14b-mlx	14B 5bit	36	10.3	20.7s	Not Rec.
starcoder2:7b	7B Q4_0	35	23.8	251 ms	Not Rec.
qwen3.5-27b-claude-4.6-opus-distilled-mlx	27B 4bit	34	6.1	22.9s	Not Rec.
qwen3.5:latest	9.7B Q4_K_M	34	13.3	19.8s	Not Rec.
gemma4:26b	25.8B Q4_K_M	33	20.0	30.0s	Not Rec.
lmstudio-community/Phi-4-reasoning-plus-MLX-4bit		33	11.5	670 ms	Not Rec.
deepseek-r1:14b	14.8B Q4_K_M	22	11.4	30.0s	Not Rec.
qwen3.5:27b	27.8B Q4_K_M	4	3.5	52.0s	Not Rec.