qwen3-vl:30b

qwen3vlmoe · 31.1B · Q4_K_M

THINKING MODEL

ASUS (Intel Core™ Ultra 9 285K)

47 GB · Microsoft Windows 11 Pro 10.0.26200

Tested on March 5, 2026 · Submitted by abayaz61
Top 18% Compare
Global Score
80 /100
Excellent
Hardware Fit
98/100
Quality
72/100

Get this model

Hardware

Machine
ASUS
CPU
Intel Core™ Ultra 9 285K
Cores
24 total (24 perf)
Frequency
3.7 GHz
RAM
47 GB DDR5
GPU
NVIDIA GeForce RTX 5090, Intel(R) Graphics
OS
Microsoft Windows 11 Pro 10.0.26200
Arch
x64
Power Mode
balanced

Performance

Tokens/sec
208.9
Standard deviation
±2.5
First chunk latency
102 ms
Time to first token
684 ms
Load time
0.1 s
Memory usage
22.0 GB (47%)
Total tokens
1290
Thinking tokens (est.)
~746

Score breakdown

Speed
50/50
Time to first token
20/20
Memory
28/30

Quality

Reasoning
14/20
Coding
6/20
Instruction following
17/20
Structured output
12/15
Math
13/15
Multilingual
10/10

Category levels

Reasoning: Adequate Coding: Weak Instruction Following: Strong Structured Output: Strong Math: Strong Multilingual: Strong

Metadata

Spec version
0.2.1
Runtime
Ollama 0.17.6
Model format
GGUF
Hardware profile
HIGH-END
Result hash
27e28cb6e53bd01b2b0eb2f37e3d856279ab3c092f3b85ca77671facf99a4d55

Interpretation

Hardware fit: 98/100. Overall suitability: EXCELLENT (Global 80/100). Category profile: Reasoning: Adequate, Coding: Weak, Instruction Following: Strong, Structured Output: Strong, Math: Strong, Multilingual: Strong.

Bench Environment

Thermal: nominal CPU load: avg 12% (peak 17%)

Run yours now

$ npm install -g metrillm@latest
$ metrillm

Requires Node 20+ and Ollama or LM Studio running

Or run without installing: npx metrillm@latest