← Back to community benchmarks

Qwen2.5-VL-3B-Instruct

M4 (10c) · 16 GB · unknown · 2026-03-14
Performance
4k
tokens
537.5
PP tok/s
14.1
TG tok/s
7621
TTFT (ms)
7.6
Peak mem (GB)
Hardware
Chip M4 (10c)
Memory 16 GB
GPU Cores 10
Software
oMLX v0.2.10
macOS macOS 26.1
Context 4,096