Community Benchmarks
Download oMLX →Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).
| # | Chip | RAM | Model | Quant | Ctx | PP tok/s | TG tok/s | Date ↓ |
|---|---|---|---|---|---|---|---|---|
| 41011 | M3 Ultra (80c) | 512 GB | Qwen3.5-35B-A3B-Heretic | 6bit | 4k | 2,168 | 52.5 | 26-03-11 |
| 41012 | M3 Ultra (80c) | 512 GB | Qwen3.5-35B-A3B-Heretic | 6bit | 1k | 2,018 | 53.7 | 26-03-11 |
| 41013 | M1 (8c) | 16 GB | gemma-3-4b-it-qat | 4bit | 4k | 160.5 | 22.8 | 26-03-11 |
| 41014 | M1 (8c) | 16 GB | gemma-3-4b-it-qat | 4bit | 1k | 161.2 | 23.2 | 26-03-11 |
| 41015 | M4 (10c) | 16 GB | Qwen2.5-Coder-7B-Instruct | 4bit | 1k | 251.0 | 23.7 | 26-03-11 |
| 41016 | M4 (10c) | 16 GB | Qwen2.5-Coder-7B-Instruct | 4bit | 4k | 245.8 | 22.3 | 26-03-11 |
| 41017 | M3 Ultra (60c) | 96 GB | InternVL3-38B | 8bit | 1k | 255.9 | 19.0 | 26-03-11 |
| 41018 | M3 Ultra (60c) | 96 GB | InternVL3-38B | 8bit | 4k | 256.1 | 17.9 | 26-03-11 |
| 41019 | M4 (10c) | 16 GB | Qwen3.5-4B | 4bit | 4k | 276.6 | 31.4 | 26-03-11 |
| 41020 | M4 (10c) | 16 GB | Qwen3.5-4B | 4bit | 1k | 282.3 | 33.4 | 26-03-11 |