← Back to community benchmarks

gemma-4-26B-A4B-it-oQ3

M4 Pro (16c) · 24 GB · 3bit · 2026-05-05
Performance
64k
tokens
418.1
PP tok/s
31.0
TG tok/s
156743
TTFT (ms)
14.8
Peak mem (GB)
Hardware
Chip M4 Pro (16c)
Memory 24 GB
GPU Cores 16
Software
oMLX v0.3.8
macOS macOS 26.3
Context 65,536