← Back to community benchmarks

GLM-4-9B-0414

M2 (10c) · 16 GB · 4bit · 2026-03-10
Performance
1k
tokens
155.6
PP tok/s
17.3
TG tok/s
6580
TTFT (ms)
5.8
Peak mem (GB)
Hardware
Chip M2 (10c)
Memory 16 GB
GPU Cores 10
Software
oMLX v0.2.6
macOS macOS 26.2
Context 1,024