← Back to community benchmarks

Qwen3-Coder-30B-A3B-Instruct

M3 Max (40c) · 64 GB · 4bit · 2026-03-16

Performance

32k

tokens

537.3

PP tok/s

32.4

TG tok/s

60984

TTFT (ms)

19.7

Peak mem (GB)

Hardware

Chip M3 Max (40c)

Memory 64 GB

GPU Cores 40

Software

oMLX v0.2.13

macOS macOS 26.3.1

Context 32,768