"The human might be asleep." One line in Karpathy's program.md started 100 automatic experiments per night.

The biggest bottleneck in code optimization is the human in the loop. You think of an idea, implement it, test it, check results, then think again. In March 2026, Andrej Karpathy removed that bottleneck. He released autoresearch , a tool that lets an AI agent edit code, run experiments, evaluate results, and keep or discard changes automatically. It hit 42,921 GitHub stars in under two weeks (GitHub API, 2026-03-19 11:56 UTC). The surprising part is where it spread. Shopify CEO Tobi Lutke applied the pattern to Liquid, a template engine running in production for 20 years. He reported a 53% reduction in parse+render time in PR #2056 . LangChain CEO hwchase17 used it to optimize agent quality scores. Ole Lehmann reported raising a Claude Code skill eval score from 56% to 92%. This is not an ML research tool anymore. It is a pattern for any task with a measurable metric. Why three files are enough The architecture is stripped to the minimum. There are three core files. program.md is the i

"The human might be asleep." One line in Karpathy's program.md started 100 automatic experiments per night.

Related Articles

Slopification and its Discontents

Instruction Best Practices: Precision Beats Clarity

Cauldron Ferm has turned microbes into nonstop assembly lines

Spotify’s new SongDNA feature maps how your favorite songs are connected

Zoox is bringing its robotaxis to Austin and Miami

Related Articles

News
Slopification and its Discontents
Lobsters • 2h ago

News
Instruction Best Practices: Precision Beats Clarity
Dev.to • 2h ago

News
Cauldron Ferm has turned microbes into nonstop assembly lines
TechCrunch • 2h ago

News
Spotify’s new SongDNA feature maps how your favorite songs are connected
TechCrunch • 2h ago

News
Zoox is bringing its robotaxis to Austin and Miami
The Verge • 2h ago