Community Benchmarks

Download oMLX →

Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).

# Chip RAM Model Quant Ctx PP tok/s TG tok/s Date ↓
1 M3 Ultra (80c) 256 GB Qwen3.5-122B-A10B 8bit 1k 863.0 42.9 26-03-09
2 M3 Ultra (80c) 256 GB Qwen3.5-122B-A10B 8bit 4k 911.3 41.4 26-03-09
3 M2 Max (30c) 64 GB Qwen3-Coder-Next 4bit 4k 441.6 56.1 26-03-09
4 M2 Max (30c) 64 GB Qwen3-Coder-Next 4bit 1k 402.4 59.7 26-03-09
5 M2 Max (30c) 64 GB Qwen3-Coder-Next 4bit 4k 442.0 56.9 26-03-09
6 M2 Max (30c) 64 GB Qwen3-Coder-Next 4bit 1k 400.0 60.0 26-03-09
7 M1 Max (32c) 64 GB Qwen3-Coder-30B-A3B-Instr... 8bit 1k 444.5 44.7 26-03-09
8 M1 Max (32c) 64 GB Qwen3-Coder-30B-A3B-Instr... 8bit 4k 462.4 37.1 26-03-09
9 M1 Max (32c) 32 GB Qwen3.5-9B 4bit 1k 312.5 56.6 26-03-09
10 M1 Max (32c) 32 GB Qwen3.5-9B 4bit 4k 317.4 53.8 26-03-09
208 results · Page 1 of 21