Your AI agent is refetching the same context on every run. Here's the fix.

I run a network of AI agents on cron schedules. Two months in, my token bills were 3-4x what they should have been. The culprit wasn't the work the agents were doing. It was the context reload at the start of every run. The problem Every loop, each agent was loading: MEMORY.md — full long-term memory (900+ tokens) SOUL.md — identity and persona (600+ tokens) TOOLS.md — tool reference (400+ tokens) Today's daily log — (200-500 tokens, growing throughout the day) That's 2,100+ tokens of overhead before the agent starts its actual task. For agents running every 15 minutes, that's 8,400+ tokens/hour per agent. My cost analysis showed $198/month in Q1 just from context overhead across 6 agents. The agents weren't expensive — their startup costs were. The fix: tiered context loading Not all context is needed on every run. Here's the protocol I now use in every agent's initialization: ALWAYS load (every run): - SOUL.md (~300 tokens) — identity and values - HEARTBEAT.md (~150 tokens) — current

Your AI agent is refetching the same context on every run. Here's the fix.

Related Articles

Typst Meetup 2026: Keynote

He Wrote 200 Lines of Code and Walked Away (What happened Next will blow your Mind)

那次面試的一題搜尋問題

*The Monkeys 3 Release "We’re Part of the Crew": Discover the Tracklist of this Instrumental Album…

Every Feature Needs One Thing Before Release: Alerts

Related Articles

News
Typst Meetup 2026: Keynote
Lobsters • 3h ago

News
He Wrote 200 Lines of Code and Walked Away (What happened Next will blow your Mind)
Medium Programming • 4h ago

News
那次面試的一題搜尋問題
Medium Programming • 4h ago

News
*The Monkeys 3 Release "We’re Part of the Crew": Discover the Tracklist of this Instrumental Album…
Medium Programming • 4h ago

News
Every Feature Needs One Thing Before Release: Alerts
Medium Programming • 5h ago