← Back to community benchmarks
Qwen2.5-Coder-3B-Instruct
Performance
8k
tokens
91.6
PP tok/s
10.8
TG tok/s
89431
TTFT (ms)
2.5
Peak mem (GB)
Hardware
Chip
M1 (7c)
Memory
16 GB
GPU Cores
7
Software
oMLX
v0.3.4
macOS
macOS 26.3.1
Context
8,192