← Back to community benchmarks
gemma-4-E4B-RotorQuant
Performance
64k
tokens
581.5
PP tok/s
8.6
TG tok/s
112705
TTFT (ms)
10.9
Peak mem (GB)
Hardware
Chip
M5 (10c)
Memory
32 GB
GPU Cores
10
Software
oMLX
v0.3.10
macOS
macOS 26.5
Context
65,536