2.78 TFLOPS on a Fanless MacBook Air? Benchmarking Apple's M4 with MLX

(This article is an English translation of a post originally published in Japanese on my blog. You can read the original Japanese version here ) . My fanless M4 MacBook Air hit 2.78 TFLOPS in a matrix multiplication benchmark using Apple's MLX framework. Matrix multiplication (GEMM) isn't just a basic math problem; it's the beating heart of modern Machine Learning and Large Language Models (LLMs). By measuring how fast a machine can multiply huge matrices, we are essentially measuring its raw capability to run AI locally. Let's see what the M4 chip can do. < Test Environment > Machine: M4 Macbook Air Memory: 16GB Python: 3.10.11 Framework: MLX v0.28.0 1. Measuring Execution Time To measure the execution time, I used a simple matrix multiplication operation. 1.1 The Benchmark Script I've published the measurement script on GitHub Gist . Feel free to download it and test it on your own Apple Silicon Mac. Note: I ran the benchmark last summer. I have recently verified that the script stil

2.78 TFLOPS on a Fanless MacBook Air? Benchmarking Apple's M4 with MLX

Related Articles

10 Lessons I Learned from a Principal Engineer That Made Me a Better Developer

The Best Developers I Know Have Stopped Learning.

How to Structure Large Flutter Projects Like Senior Developers

Why the Monolith is a Dead End for the Weekend Indie Developer

Understand OpenClaw by Building One —Part 3

Related Articles

How-To
10 Lessons I Learned from a Principal Engineer That Made Me a Better Developer
Medium Programming • 3h ago

How-To
The Best Developers I Know Have Stopped Learning.
Medium Programming • 3h ago

How-To
How to Structure Large Flutter Projects Like Senior Developers
Medium Programming • 3h ago

How-To
Why the Monolith is a Dead End for the Weekend Indie Developer
Medium Programming • 3h ago

How-To
Understand OpenClaw by Building One —Part 3
Medium Programming • 4h ago