← Back to community benchmarks

gemma-4-26b-a4b-it-UD

M3 Max (40c) · 64 GB · 4bit · 2026-04-29
Performance
64k
tokens
801.2
PP tok/s
18.3
TG tok/s
81795
TTFT (ms)
18.1
Peak mem (GB)
Hardware
Chip M3 Max (40c)
Memory 64 GB
GPU Cores 40
Software
oMLX v0.3.6
macOS macOS 15.7.4
Context 65,536