qwen3:1.7b

qwen3 · 2.0B · Q4_K_M

THINKING MODEL ECO MODE

AZW GTR Pro (AMD RYZEN AI MAX+ 395)

125 GB · Ubuntu 24.04.4 LTS

Tested on March 5, 2026
Top 39% Compare
Global Score
72 /100
Good
Hardware Fit
92/100
Quality
63/100

Get this model

Hardware

Machine
AZW GTR Pro
CPU
AMD RYZEN AI MAX+ 395
Cores
32 threads (16 cores)
Frequency
3 GHz
RAM
125 GB
GPU
AMD Radeon 8060S
OS
Ubuntu 24.04.4 LTS
Arch
x64
Power Mode
low-power

Performance

Tokens/sec
120.7
Standard deviation
±0.1
First chunk latency
93 ms
Time to first token
1.5 s
Load time
2.5 s
Memory usage
5.8 GB (5%)
Total tokens
1416
Thinking tokens (est.)
~942

Score breakdown

Speed
50/50
Time to first token
12/20
Memory
30/30

Quality

Reasoning
13/20
Coding
6/20
Instruction following
13/20
Structured output
14/15
Math
7/15
Multilingual
10/10

Category levels

Reasoning: Adequate Coding: Weak Instruction Following: Adequate Structured Output: Strong Math: Weak Multilingual: Strong

Metadata

Spec version
0.2.1
Runtime
Ollama 0.17.4
Model format
GGUF
Hardware profile
HIGH-END
Result hash
c7caca48b7927d200e97739b86a82b9d75bd0591080b9f6e5ef47379bd37160f

Interpretation

Hardware fit: 92/100. Overall suitability: GOOD (Global 72/100). Category profile: Reasoning: Adequate, Coding: Weak, Instruction Following: Adequate, Structured Output: Strong, Math: Weak, Multilingual: Strong.

Warnings

  • System was in low-power mode during this benchmark.
  • CPU appears throttled (2.2 GHz current vs 3.0 GHz nominal, 73%).

Bench Environment

Thermal: nominal CPU load: avg 4% (peak 4%)

Run yours now

$ npm install -g metrillm@latest
$ metrillm

Requires Node 20+ and Ollama or LM Studio running

Or run without installing: npx metrillm@latest