← Back to community benchmarks

Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-oQ8e

M2 Ultra (60c) · 192 GB · 8bit · 2026-05-03
Performance
4k
tokens
1,848
PP tok/s
72.9
TG tok/s
2217
TTFT (ms)
36.2
Peak mem (GB)
Hardware
Chip M2 Ultra (60c)
Memory 192 GB
GPU Cores 60
Software
oMLX v0.3.8
macOS macOS 26.4.1
Context 4,096
Performance by Context Length
Context PP tok/s TG tok/s Peak Mem
4k 1,848 72.9 36.2 GB current
8k 2,007 74.6 36.5 GB view
16k 1,944 68.1 37.3 GB view
32k 1,701 65.7 38.5 GB view