Back to leaderboard

glm-4.7-flash:latest

OLLAMA GGUF

glm4moelite · 29.9B · Q4_K_M

Excellent

Mar 27, 2026 · AMD RYZEN AI MAX+ 395

unsloth/qwen3.5-9b@q4_k_m

LM-STUDIO GGUF

qwen35 · 9B · Q4_K_M

Good

Jun 29, 2026 · Intel Gen Intel® Core™ i9-11900H

Global Score
83 vs 73
Hardware Fit
100 vs 52
Quality Score
76 vs 82

Hardware

glm-4.7-flash:latest unsloth/qwen3.5-9b…
MachineAZW GTR ProDocker Container
CPUAMD RYZEN AI MAX+ 395Intel Gen Intel® Core™ i9-11900H
Cores3216
RAM125 GB15 GB
GPURadeon 8060STigerLake-H GT1 [UHD Graphics]
OSUbuntu 24.04.4 LTSUbuntu 26.04 LTS
Archx64x64
Power Modeperformanceperformance

Performance

glm-4.7-flash:latest unsloth/qwen3.5-9b…
Tokens/sec58.06.5
First chunk268 ms13 ms
TTFT268 ms4.1 s
Load time7.8 s10.8 s
Memory usage37.7 GB2.2 GB
Memory %30%14%

HW Fit Score Breakdown

glm-4.7-flash:latest

Speed
50/50
TTFT
20/20
Memory
30/30

unsloth/qwen3.5-9b@q4_k_m

Speed
11/50
TTFT
11/20
Memory
30/30

Quality

glm-4.7-flash:latest

Reasoning
13/20
Coding
17/20
Instruction
11/20
Structured
15/15
Math
10/15
Multilingual
10/10
Reasoning: Adequate Coding: Strong Instruction Following: Adequate Structured Output: Strong Math: Adequate Multilingual: Strong

unsloth/qwen3.5-9b@q4_k_m

Reasoning
15/20
Coding
17/20
Instruction
16/20
Structured
15/15
Math
9/15
Multilingual
10/10
Reasoning: Strong Coding: Strong Instruction Following: Strong Structured Output: Strong Math: Adequate Multilingual: Strong

Run yours and compare

$ npm install -g metrillm@latest
$ metrillm

Requires Node 20+ and Ollama or LM Studio running

Or run without installing: npx metrillm@latest