← Back to community benchmarks

Qwen2.5-7B-Instruct

M1 (8c) · 16 GB · 4bit · 2026-03-16

Performance

1k

tokens

132.3

PP tok/s

13.0

TG tok/s

7741

TTFT (ms)

4.5

Peak mem (GB)

Hardware

Chip M1 (8c)

Memory 16 GB

GPU Cores 8

Software

oMLX v0.2.10

macOS macOS 26.3.1

Context 1,024

Batching Results

Batch Size	TG tok/s	Speedup
1×	13.0	1.00×
2×	24.9	1.92×