← Back to community benchmarks

Qwen3.5-122B-A10B6.5bit

M3 Ultra (60c) · 256 GB · 6bit · 2026-03-23
Performance
32k
tokens
597.7
PP tok/s
38.9
TG tok/s
54827
TTFT (ms)
99.7
Peak mem (GB)
Hardware
Chip M3 Ultra (60c)
Memory 256 GB
GPU Cores 60
Software
oMLX v0.2.20.dev2
macOS macOS 26.3.1
Context 32,768