mlx-Chronos Leaderboard

Community benchmark results for local LLM inference on Apple Silicon. — GitHub · Methodology · Submit your results

Chip RAM Engine Model Quant tok/s ↓ TTFT cold (s) TTFT cached (s) Base RAM Load Engine RAM (GB) Thermal Trials Date