← Back to community benchmarks

Qwen3-8B

M4 (10c) · 16 GB · 4bit · 2026-03-19
Performance
4k
tokens
205.1
PP tok/s
10.7
TG tok/s
19967
TTFT (ms)
9.8
Peak mem (GB)
Hardware
Chip M4 (10c)
Memory 16 GB
GPU Cores 10
Software
oMLX v0.2.19
macOS macOS 26.1
Context 4,096