← Back to community benchmarks

GLM-5.1-MXFP4-Q8

M3 Ultra (80c) · 512 GB · 8bit · 2026-04-13
Performance
4k
tokens
180.9
PP tok/s
12.1
TG tok/s
22641
TTFT (ms)
381.8
Peak mem (GB)
Hardware
Chip M3 Ultra (80c)
Memory 512 GB
GPU Cores 80
Software
oMLX v0.3.4
macOS macOS 26.3.1
Context 4,096