← Back to community benchmarks

Qwen3.5-122B-A10B

M3 Ultra (60c) · 96 GB · 4bit · 2026-04-06
Performance
32k
tokens
606.1
PP tok/s
31.1
TG tok/s
54060
TTFT (ms)
71.2
Peak mem (GB)
Hardware
Chip M3 Ultra (60c)
Memory 96 GB
GPU Cores 60
Software
oMLX v0.3.0
macOS macOS 26.2
Context 32,768