← Back to community benchmarks

Qwen3.5-0.8B

M4 (10c) · 16 GB · 8bit · 2026-03-11
Performance
16k
tokens
1,890
PP tok/s
81.2
TG tok/s
8668
TTFT (ms)
2.4
Peak mem (GB)
Hardware
Chip M4 (10c)
Memory 16 GB
GPU Cores 10
Software
oMLX v0.2.7
macOS macOS 26.3.1
Context 16,384