← Back to community benchmarks

Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled

M2 Max (38c) · 32 GB · 4bit · 2026-04-12
Performance
4k
tokens
762.0
PP tok/s
80.9
TG tok/s
5375
TTFT (ms)
20.0
Peak mem (GB)
Hardware
Chip M2 Max (38c)
Memory 32 GB
GPU Cores 38
Software
oMLX v0.3.5.dev1
macOS macOS 26.4
Context 4,096