qwen2.5-7b-instruct

qwen2 · 7B · 4bit

MacBook Air (Apple M4)

32 GB · macOS 26.3

Tested on March 3, 2026 · Submitted by Topaz750
Top 8% Compare
Global Score
84 /100
Excellent
Hardware Fit
97/100
Quality
78/100

Get this model

Hardware

Machine
MacBook Air
CPU
Apple M4
Cores
10 total (4 perf + 6 eff)
Frequency
2.4 GHz
RAM
32 GB LPDDR5
GPU
Apple M4
OS
macOS 26.3
Arch
arm64
Power Mode
balanced

Performance

Tokens/sec
24.4
Standard deviation
±0.5
First chunk latency
464 ms
Time to first token
464 ms
Load time
N/A
Memory usage
4.0 GB (13%)
Total tokens
997

Score breakdown

Speed
47/50
Time to first token
20/20
Memory
30/30

Quality

Reasoning
14/20
Coding
17/20
Instruction following
15/20
Structured output
15/15
Math
8/15
Multilingual
9/10

Category levels

Reasoning: Adequate Coding: Strong Instruction Following: Strong Structured Output: Strong Math: Adequate Multilingual: Strong

Metadata

Spec version
0.2.0
Runtime
LM Studio 0.4.6+1
Model format
MLX
Hardware profile
BALANCED
Result hash
261702777a73b4710eda1a46d54ed44338649b9fbfd724e2380d3c0e845038b0

Interpretation

Hardware fit: 97/100. Overall suitability: EXCELLENT (Global 84/100). Category profile: Reasoning: Adequate, Coding: Strong, Instruction Following: Strong, Structured Output: Strong, Math: Adequate, Multilingual: Strong.

Bench Environment

Power: AC CPU load: avg 9% (peak 11%)

Run yours now

$ npm install -g metrillm@latest
$ metrillm

Requires Node 20+ and Ollama or LM Studio running

Or run without installing: npx metrillm@latest