← Back to community benchmarks

Qwen3.5-122B-A10B-5bit

M3 Max (40c) · 128 GB · 5bit · 2026-04-04
Performance
1k
tokens
427.4
PP tok/s
41.4
TG tok/s
2396
TTFT (ms)
80.5
Peak mem (GB)
Hardware
Chip M3 Max (40c)
Memory 128 GB
GPU Cores 40
Software
oMLX v0.3.2
macOS macOS 26.4
Context 1,024