← Back to community benchmarks

qwen-model

M1 (8c) · 16 GB · 6bit · 2026-03-16

Performance

1k

tokens

74.6

PP tok/s

8.1

TG tok/s

13721

TTFT (ms)

7.8

Peak mem (GB)

Hardware

Chip M1 (8c)

Memory 16 GB

GPU Cores 8

Software

oMLX v0.2.13

macOS macOS 26.2

Context 1,024

Performance by Context Length

Context	PP tok/s	TG tok/s	Peak Mem
1k	74.6	8.1	7.8 GB	current
4k	74.4	7.3	8.0 GB	view

Batching Results

Batch Size	TG tok/s	Speedup
1×	8.1	1.00×
2×	14.1	1.74×
4×	12.0	1.48×