Performance
4k
tokens
345.5
PP tok/s
44.0
TG tok/s
11857
TTFT (ms)
26.2
Peak mem (GB)
Hardware
Chip
M4 Pro (20c)
Memory
64 GB
GPU Cores
20
Software
oMLX
v0.3.8
macOS
macOS 26.3
Context
4,096