← Back to community benchmarks

gemma-4-31B-it-oQ8

M3 Ultra (60c) · 256 GB · 8bit · 2026-04-03
Performance
64k
tokens
196.8
PP tok/s
3.9
TG tok/s
333032
TTFT (ms)
43.0
Peak mem (GB)
Hardware
Chip M3 Ultra (60c)
Memory 256 GB
GPU Cores 60
Software
oMLX v0.3.0
macOS macOS 26.3.1
Context 65,536