Community Benchmarks

Download oMLX →

Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).

# Chip RAM Model Quant Ctx PP tok/s TG tok/s Date ↓
35741 M3 Ultra (80c) 512 GB Qwen3.5-9B 4bit 1k 1,465 105.7 26-03-09
35742 M3 Ultra (80c) 512 GB Qwen3.5-9B 4bit 4k 1,485 101.0 26-03-09
35743 M4 (10c) 16 GB Qwen3.5-9B 4bit 4k 221.7 19.2 26-03-09
35744 M4 (10c) 16 GB Qwen3.5-9B 4bit 1k 221.7 19.4 26-03-09
35745 M5 (10c) 24 GB Qwen3.5-0.8B 8bit 4k 2,382 130.0 26-03-09
35746 M5 (10c) 24 GB Qwen3.5-0.8B 8bit 1k 2,302 132.1 26-03-09
35747 M4 Pro (16c) 24 GB Qwen3.5-9B 4bit 1k 361.4 49.7 26-03-09
35748 M3 (10c) 24 GB Llama-3.2-1B-Instruct 4bit 64k 278.2 8.1 26-03-09
35749 M3 (10c) 24 GB Llama-3.2-1B-Instruct 4bit 32k 596.2 32.2 26-03-09
35750 M3 (10c) 24 GB Llama-3.2-1B-Instruct 4bit 16k 898.6 57.1 26-03-09
36,826 results · Page 3575 of 3683