← Back to community benchmarks

gemma-4-E4B-it-heretic-oQ5

M4 Pro (20c) · 24 GB · 5bit · 2026-05-23
Performance
64k
tokens
963.7
PP tok/s
29.2
TG tok/s
68008
TTFT (ms)
7.7
Peak mem (GB)
Hardware
Chip M4 Pro (20c)
Memory 24 GB
GPU Cores 20
Software
oMLX v0.3.9
macOS macOS 26.5
Context 65,536