← Back to community benchmarks

GLM-4.7-Flash

M5 Pro (20c) · 64 GB · 8bit · 2026-03-21
Performance
32k
tokens
374.8
PP tok/s
6.0
TG tok/s
87420
TTFT (ms)
41.3
Peak mem (GB)
Hardware
Chip M5 Pro (20c)
Memory 64 GB
GPU Cores 20
Software
oMLX v0.2.20.dev1
macOS macOS 26.3.1
Context 32,768