← Back to community benchmarks

gemma-4-E4B-TurboQuant

M4 Pro (16c) · 24 GB · 8bit · 2026-05-27
Performance
64k
tokens
762.5
PP tok/s
25.8
TG tok/s
85951
TTFT (ms)
10.5
Peak mem (GB)
Hardware
Chip M4 Pro (16c)
Memory 24 GB
GPU Cores 16
Software
oMLX v0.3.12
macOS macOS 26.4.1
Context 65,536