← Back to community benchmarks
Qwen2.5-Coder-3B-Instruct
Performance
16k
tokens
73.9
PP tok/s
8.1
TG tok/s
221828
TTFT (ms)
2.7
Peak mem (GB)
Hardware
Chip
M1 (7c)
Memory
16 GB
GPU Cores
7
Software
oMLX
v0.3.4
macOS
macOS 26.3.1
Context
16,384