← Back to community benchmarks

Qwen3.5-9B8bit-mtp

M4 Pro (16c) · 24 GB · 8bit · 2026-05-12
Performance
1k
tokens
334.3
PP tok/s
21.3
TG tok/s
3063
TTFT (ms)
10.4
Peak mem (GB)
Hardware
Chip M4 Pro (16c)
Memory 24 GB
GPU Cores 16
Software
oMLX v0.3.9.dev1
macOS macOS 26.4.1
Context 1,024