← Back to community benchmarks

gemma-4-26B-A4B-it-heretic-oQ8

M2 Max (38c) · 96 GB · 8bit · 2026-04-28
Performance
1k
tokens
411.3
PP tok/s
42.3
TG tok/s
2490
TTFT (ms)
25.8
Peak mem (GB)
Hardware
Chip M2 Max (38c)
Memory 96 GB
GPU Cores 38
Software
oMLX v0.3.8.dev3
macOS macOS 15.7.3
Context 1,024