qwen3:4b
qwen3 · 4.0B · Q4_K_M
THINKING MODEL
NVIDIA NVIDIA Jetson Orin Nano Engineering Reference Developer Kit Super (Cortex-A78AE)
7 GB · Ubuntu 22.04.5 LTS
Tested on June 26, 2026
Global Score
29 /100
Not Rec.
Hardware Fit
86/100
Quality
4/100
Get this model
Hardware
- Machine
- NVIDIA NVIDIA Jetson Orin Nano Engineering Reference Developer Kit Super
- CPU
- Cortex-A78AE
- Cores
- 6 total (6 perf)
- Frequency
- 1.04 GHz
- RAM
- 7 GB
- GPU
- Integrated / Unknown
- OS
- Ubuntu 22.04.5 LTS
- Arch
- arm64
- Power Mode
- balanced
Performance
- Tokens/sec
- 17.9
- Standard deviation
- ±0.0
- First chunk latency
- 1.0 s
- Time to first token
- 6.4 s
- Load time
- 9.5 s
- Memory usage
- 3.0 GB (40%)
- Total tokens
- 1320
- Thinking tokens (est.)
- ~722
Score breakdown
Speed
50/50
Time to first token
8/20
Memory
28/30
Quality
Reasoning
4/20
Coding
0/20
Instruction following
0/20
Structured output
0/15
Math
0/15
Multilingual
0/10
Category levels
Reasoning: Poor Coding: Poor Instruction Following: Poor Structured Output: Poor Math: Poor Multilingual: Poor
Metadata
- Spec version
- 0.2.1
- Runtime
- Ollama 0.30.10
- Model format
- GGUF
- Hardware profile
- ENTRY
- Result hash
- 635465fefd18022bafc32e948378c9281a47c7b845bdd0f0f749160218f94889
Interpretation
Hardware fit: 86/100. Overall suitability: NOT RECOMMENDED (Global 29/100). Category profile: Reasoning: Poor, Coding: Poor, Instruction Following: Poor, Structured Output: Poor, Math: Poor, Multilingual: Poor. Warning: model produced very low accuracy on quality tasks — results may be unusable despite good hardware performance.
Warnings
- Model produced very low accuracy on quality tasks — results may be unusable despite good hardware performance.
Bench Environment
Thermal: nominal CPU load: avg 4% (peak 11%)
Run yours now
$
npm install -g metrillm@latest$
metrillmRequires Node 20+ and Ollama or LM Studio running
Or run without installing: npx metrillm@latest