Community Benchmarks

Download oMLX →

Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).

# Chip RAM Model Quant Ctx PP tok/s TG tok/s Date ↓
290541 M4 (10c) 24 GB Llama-3.2-3B-Instruct 4bit 1k 520.4 45.2 26-03-09
290542 M4 (10c) 24 GB Llama-3.2-3B-Instruct 4bit 4k 478.6 37.0 26-03-09
290543 M4 (10c) 24 GB Qwen3.5-4B 4bit 4k 380.6 34.1 26-03-09
290544 M4 (10c) 24 GB Qwen3.5-4B 4bit 1k 378.3 35.1 26-03-09
290545 M2 Ultra (76c) 128 GB Qwen3.5-0.8B bf16 64k 4,660 83.2 26-03-09
290546 M4 (10c) 16 GB Qwen3.5-4B 4bit 1k 400.3 38.8 26-03-09
290547 M4 (10c) 16 GB Qwen3.5-0.8B 8bit 1k 2,051 99.8 26-03-09
290548 M4 Max (40c) 128 GB Qwen3.5-9B 8bit 4k 890.4 50.9 26-03-09
290549 M4 Max (40c) 128 GB Qwen3.5-9B 8bit 1k 878.9 52.1 26-03-09
290550 M4 (10c) 16 GB Qwen3.5-0.8B 8bit 1k 1,656 90.4 26-03-09
290,750 results · Page 29055 of 29075