← Back to community benchmarks

gemma-4-31b-it

M2 Max (30c) · 64 GB · 6bit · 2026-04-12
Performance
1k
tokens
74.2
PP tok/s
11.0
TG tok/s
13795
TTFT (ms)
24.7
Peak mem (GB)
Hardware
Chip M2 Max (30c)
Memory 64 GB
GPU Cores 30
Software
oMLX v0.3.5.dev1
macOS macOS 26.4
Context 1,024