Tech▲ 60

Ollama boosts Mac AI model performance with MLX

Ars Technica·March 31, 2026 at 11:00 PM

Ollama, a local large language model runtime, now supports Apple's MLX framework, significantly enhancing performance on Apple Silicon Macs. This update leverages unified memory more effectively and improves caching, alongside support for Nvidia's NVFP4 compression format. These advancements aim to make running sophisticated AI models locally more feasible and efficient, addressing user frustration with cloud service costs and rate limits. While still in preview and requiring substantial hardware like 32GB RAM, this development marks a notable step for Mac users seeking powerful, private AI processing capabilities.

Ollama boosts Mac AI model performance with MLX

Google Gemini API improves coding agents

China Buybacks Surge Amid Iran War Concerns

Google plans screenless Fitbit band

Anthropic accidentally leaks code for Claude Code