Community Benchmarks
Download oMLX →Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).
| # | Chip | RAM | Model | Quant | Ctx | PP tok/s | TG tok/s | Date ↓ |
|---|---|---|---|---|---|---|---|---|
| 206601 | M1 Max (32c) | 64 GB | gemma-4-26B-A4B-it-oQ3 | 3bit | 32k | 544.7 | 24.2 | 26-04-16 |
| 206602 | M3 Ultra (80c) | 512 GB | Nemotron-3-Super-120B-A12... | 6bit | 1k | 596.0 | 45.0 | 26-04-16 |
| 206603 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 128k | 375.0 | 17.2 | 26-04-16 |
| 206604 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 64k | 545.8 | 31.4 | 26-04-16 |
| 206605 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 32k | 658.7 | 43.0 | 26-04-16 |
| 206606 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 16k | 727.1 | 52.4 | 26-04-16 |
| 206607 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 8k | 757.4 | 58.4 | 26-04-16 |
| 206608 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 4k | 756.9 | 61.7 | 26-04-16 |
| 206609 | M2 Max (38c) | 64 GB | Qwen3.5-35B-A3B | 8bit | 1k | 625.9 | 65.5 | 26-04-16 |
| 206610 | M3 Ultra (80c) | 512 GB | gemma-4-31b-it | 4bit | 1k | 328.4 | 29.1 | 26-04-16 |