Community Benchmarks
Download oMLX →Real performance numbers from oMLX users around the world. Submit yours from the app (v0.2.6+).
| # | Chip | RAM | Model | Quant | Ctx | PP tok/s | TG tok/s | Date ↓ |
|---|---|---|---|---|---|---|---|---|
| 35371 | M3 (10c) | 24 GB | Llama-3.2-1B-Instruct | 4bit | 64k | 278.2 | 8.1 | 26-03-09 |
| 35372 | M3 (10c) | 24 GB | Llama-3.2-1B-Instruct | 4bit | 32k | 596.2 | 32.2 | 26-03-09 |
| 35373 | M3 (10c) | 24 GB | Llama-3.2-1B-Instruct | 4bit | 16k | 898.6 | 57.1 | 26-03-09 |
| 35374 | M3 (10c) | 24 GB | Llama-3.2-1B-Instruct | 4bit | 8k | 1,130 | 59.3 | 26-03-09 |
| 35375 | M3 (10c) | 24 GB | Llama-3.2-1B-Instruct | 4bit | 4k | 1,216 | 76.7 | 26-03-09 |
| 35376 | M3 (10c) | 24 GB | Llama-3.2-1B-Instruct | 4bit | 1k | 1,249 | 102.4 | 26-03-09 |
| 35377 | M4 (8c) | 16 GB | Llama-3.2-3B-Instruct | 4bit | 1k | 419.8 | 47.9 | 26-03-09 |
| 35378 | M4 (8c) | 16 GB | Llama-3.2-3B-Instruct | 4bit | 4k | 382.4 | 37.8 | 26-03-09 |
| 35379 | M2 Pro (16c) | 16 GB | Qwen3.5-9B | 8bit | 1k | 171.5 | 20.0 | 26-03-09 |
| 35380 | M2 Pro (16c) | 16 GB | Qwen3.5-9B | 8bit | 4k | 172.3 | 18.1 | 26-03-09 |