← Back to community benchmarks

gemma-4-E4B-TurboQuant

M4 Pro (16c) · 24 GB · 4bit · 2026-05-27
Performance
64k
tokens
756.2
PP tok/s
27.6
TG tok/s
86666
TTFT (ms)
7.1
Peak mem (GB)
Hardware
Chip M4 Pro (16c)
Memory 24 GB
GPU Cores 16
Software
oMLX v0.3.12
macOS macOS 26.4.1
Context 65,536