← Back to community benchmarks

gemma-4-26B-A4B-it-oQ8

M3 Ultra (60c) · 256 GB · 8bit · 2026-04-03
Performance
64k
tokens
1,351
PP tok/s
25.0
TG tok/s
48497
TTFT (ms)
31.2
Peak mem (GB)
Hardware
Chip M3 Ultra (60c)
Memory 256 GB
GPU Cores 60
Software
oMLX v0.3.0
macOS macOS 26.3.1
Context 65,536