← Back to community benchmarks

Qwen3.5-0.8B

M1 (5c) · 8 GB · 8bit · 2026-03-19
Performance
1k
tokens
1,042
PP tok/s
58.1
TG tok/s
983
TTFT (ms)
1.6
Peak mem (GB)
Hardware
Chip M1 (5c)
Memory 8 GB
GPU Cores 5
Software
oMLX v0.2.18
macOS macOS 26.4
Context 1,024
Performance by Context Length
Context PP tok/s TG tok/s Peak Mem
1k 1,042 58.1 1.6 GB current
4k 974.4 48.4 2.3 GB view
Batching Results
Batch Size TG tok/s Speedup
58.1 1.00×
48.2 0.83×
119.4 2.06×