← Back to community benchmarks

Qwen3-0.6B

M2 (8c) · 16 GB · 4bit · 2026-03-12

Performance

1k

tokens

389.9

PP tok/s

37.4

TG tok/s

2627

TTFT (ms)

1.0

Peak mem (GB)

Hardware

Chip M2 (8c)

Memory 16 GB

GPU Cores 8

Software

oMLX v0.2.9

macOS macOS 26.3.1

Context 1,024

Batching Results

Batch Size	TG tok/s	Speedup
1×	37.4	1.00×
2×	46.2	1.24×