← Back to community benchmarks

gemma-4-31b-it-mxfp8

M3 Ultra (60c) · 256 GB · 8bit · 2026-05-18
Performance
16k
tokens
237.9
PP tok/s
16.5
TG tok/s
68861
TTFT (ms)
33.8
Peak mem (GB)
Hardware
Chip M3 Ultra (60c)
Memory 256 GB
GPU Cores 60
Software
oMLX v0.3.8
macOS macOS 26.3.1
Context 16,384