Community Benchmarks

Download oMLX →

Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).

# Chip RAM Model Quant Ctx PP tok/s TG tok/s Date ↓
11381 M1 Max (32c) 64 GB Qwen3.5-35B-A3B 4bit 4k 585.0 60.1 26-03-09
11382 M1 Max (32c) 64 GB Qwen3.5-9B 8bit 1k 304.3 35.6 26-03-09
11383 M1 Max (32c) 64 GB Qwen3.5-9B 8bit 4k 319.1 34.7 26-03-09
11384 M3 Ultra (80c) 512 GB GLM-5-4.8bit 4bit 32k 73.2 12.3 26-03-09
11385 M3 Ultra (80c) 512 GB GLM-5-4.8bit 4bit 16k 119.8 13.0 26-03-09
11386 M3 Ultra (80c) 512 GB GLM-5-4.8bit 4bit 8k 156.3 13.5 26-03-09
11387 M3 Ultra (80c) 512 GB GLM-5-4.8bit 4bit 4k 182.6 13.7 26-03-09
11388 M3 Ultra (80c) 512 GB GLM-5-4.8bit 4bit 1k 188.7 16.6 26-03-09
11389 M1 (7c) 16 GB Qwen3.5-0.8B 4bit 64k 408.6 17.3 26-03-09
11390 M1 (7c) 16 GB Qwen3.5-0.8B 4bit 32k 552.2 47.8 26-03-09
11,553 results · Page 1139 of 1156