← Back to community benchmarks
Qwen2.5-Coder-3B-Instruct
Performance
32k
tokens
57.2
PP tok/s
10.3
TG tok/s
572960
TTFT (ms)
3.3
Peak mem (GB)
Hardware
Chip
M1 (7c)
Memory
16 GB
GPU Cores
7
Software
oMLX
v0.3.4
macOS
macOS 26.3.1
Context
32,768