Community Benchmarks
Download oMLX →Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).
| # | Chip | RAM | Model | Quant | Ctx | PP tok/s | TG tok/s | Date ↓ |
|---|---|---|---|---|---|---|---|---|
| 3221 | M2 Ultra (76c) | 128 GB | Qwen3.5-4B | 8bit | 1k | 1,423 | 99.9 | 26-03-09 |
| 3222 | M4 (10c) | 24 GB | Llama-3.2-3B-Instruct | 4bit | 1k | 520.4 | 45.2 | 26-03-09 |
| 3223 | M4 (10c) | 24 GB | Llama-3.2-3B-Instruct | 4bit | 4k | 478.6 | 37.0 | 26-03-09 |
| 3224 | M4 (10c) | 24 GB | Qwen3.5-4B | 4bit | 4k | 380.6 | 34.1 | 26-03-09 |
| 3225 | M4 (10c) | 24 GB | Qwen3.5-4B | 4bit | 1k | 378.3 | 35.1 | 26-03-09 |
| 3226 | M2 Ultra (76c) | 128 GB | Qwen3.5-0.8B | bf16 | 64k | 4,660 | 83.2 | 26-03-09 |
| 3227 | M4 (10c) | 16 GB | Qwen3.5-4B | 4bit | 1k | 400.3 | 38.8 | 26-03-09 |
| 3228 | M4 (10c) | 16 GB | Qwen3.5-0.8B | 8bit | 1k | 2,051 | 99.8 | 26-03-09 |
| 3229 | M4 Max (40c) | 128 GB | Qwen3.5-9B | 8bit | 4k | 890.4 | 50.9 | 26-03-09 |
| 3230 | M4 Max (40c) | 128 GB | Qwen3.5-9B | 8bit | 1k | 878.9 | 52.1 | 26-03-09 |