← Back to community benchmarks

Qwen3.5-9B-oQ4-mtp

M2 Max (30c) · 32 GB · 4bit · 2026-05-20
Performance
32k
tokens
243.8
PP tok/s
31.4
TG tok/s
134400
TTFT (ms)
9.4
Peak mem (GB)
Hardware
Chip M2 Max (30c)
Memory 32 GB
GPU Cores 30
Software
oMLX v0.3.9rc1
macOS macOS 26.3.1
Context 32,768