← Back to community benchmarks

gemma-4-31B-it

M2 Max (38c) · 64 GB · 8bit · 2026-05-15
Performance
64k
tokens
80.4
PP tok/s
7.3
TG tok/s
815568
TTFT (ms)
41.8
Peak mem (GB)
Hardware
Chip M2 Max (38c)
Memory 64 GB
GPU Cores 38
Software
oMLX v0.3.9.dev2
macOS macOS 26.5
Context 65,536