← Back to community benchmarks

Qwen3.5-122B-A10B6.5bit

M3 Ultra (60c) · 256 GB · 6bit · 2026-03-23
Performance
1k
tokens
597.3
PP tok/s
47.0
TG tok/s
1715
TTFT (ms)
94.0
Peak mem (GB)
Hardware
Chip M3 Ultra (60c)
Memory 256 GB
GPU Cores 60
Software
oMLX v0.2.20.dev2
macOS macOS 26.3.1
Context 1,024