qwen3.6-35b-a3b-claude-4.7-opus-reasoning-distilled-apex

qwen35moe · 35B

ASUS (AMD Ryzen 7 7800X3D 8-Core Processor)

31 GB · cachyos rolling

Tested on July 5, 2026
Top 16% Compare
Global Score
84 /100
Excellent
Hardware Fit
79/100
Quality
86/100

Get this model

Hardware

Machine
ASUS
CPU
AMD Ryzen 7 7800X3D 8-Core Processor
Cores
16 threads (8 cores)
Frequency
3.56 GHz
RAM
31 GB
GPU
NVIDIA GeForce RTX 4090, Raphael
OS
cachyos rolling
Arch
x64
Power Mode
performance

Performance

Tokens/sec
147.6
Standard deviation
±3.4
First chunk latency
3 ms
Time to first token
164 ms
Load time
N/A
Memory usage
25.1 GB (82%)
Total tokens
1353

Score breakdown

Speed
50/50
Time to first token
20/20
Memory
9/30

Quality

Reasoning
18/20
Coding
17/20
Instruction following
14/20
Structured output
14/15
Math
13/15
Multilingual
10/10

Category levels

Reasoning: Strong Coding: Strong Instruction Following: Adequate Structured Output: Strong Math: Strong Multilingual: Strong

Metadata

Spec version
0.2.1
Runtime
LM Studio
Model format
GGUF
Hardware profile
BALANCED
Result hash
dbd2fadd60ae01e4170207674fdc82d0f86424e1b9bf0872ea91e6f1ba1010e9

Interpretation

Hardware fit: 79/100. Overall suitability: EXCELLENT (Global 84/100). Category profile: Reasoning: Strong, Coding: Strong, Instruction Following: Adequate, Structured Output: Strong, Math: Strong, Multilingual: Strong.

Warnings

  • Model memory footprint is estimated via LM Studio CLI rather than measured from a fresh load.

Bench Environment

Thermal: nominal Swap delta: +0.0 GB CPU load: avg 7% (peak 9%)

Run yours now

$ npm install -g metrillm@latest
$ metrillm

Requires Node 20+ and Ollama or LM Studio running

Or run without installing: npx metrillm@latest