← Back to community benchmarks

gemma-4-26B-A4B-it

M5 Pro (16c) · 48 GB · 8bit · 2026-05-14
Performance
8k
tokens
1,710
PP tok/s
48.6
TG tok/s
4791
TTFT (ms)
26.6
Peak mem (GB)
Hardware
Chip M5 Pro (16c)
Memory 48 GB
GPU Cores 16
Software
oMLX v0.3.9.dev2
macOS macOS 26.3.2
Context 8,192
Performance by Context Length
Context PP tok/s TG tok/s Peak Mem
4k 1,358 50.4 26.2 GB view
8k 1,710 48.6 26.6 GB current
16k 1,622 45.4 27.1 GB view