FlareStart
HomeNewsHow ToSources
FlareStart

Where developers start their day. All the tech news & tutorials that matter, in one place.

Quick Links

  • Home
  • News
  • Tutorials
  • Sources
  • Privacy Policy

Connect

© 2026 FlareStart. All rights reserved.

Back to articles
Reducing bootstrap memory cost in LLM agents
How-ToMachine Learning

Reducing bootstrap memory cost in LLM agents

via Dev.toSandy Shen2h ago

LLM agents are stateless by default. To get continuity, the standard approach is to load everything into the system prompt. Logs, past decisions, project state. It works, but it is wasteful. We were spending 3,500+ tokens on memory before the agent even started doing anything useful. If you load nothing, you get the opposite problem. The agent forgets preferences and repeats the same mistakes every session. We stopped trying to tune the context window and changed how memory is handled. Instead of loading everything at once, we split memory into three parts: Hot : A small set of curated facts that are always loaded, around 625 tokens. Warm : Recent logs from the last 7 days, only pulled in when needed. Cold : Older history stored externally and not loaded by default. Most of the time, the agent only needs one or two specific pieces of context. That simple change made a big difference. In our setup, bootstrap memory cost dropped from around 3,500 tokens to about 125 tokens, roughly a 96

Continue reading on Dev.to

Opens in a new tab

Read Full Article
0 views

Related Articles

How-To

Why New Bug Bounty Hunters Get Stuck — And How to Fix It

Medium Programming • 3h ago

Beyond the Code: Why the 7-Step Development Lifecycle is Your Competitive Advantage.‍
How-To

Beyond the Code: Why the 7-Step Development Lifecycle is Your Competitive Advantage.‍

Medium Programming • 4h ago

HadisKu Is Now Ad-Free: Why I Removed Ads From My Islamic App
How-To

HadisKu Is Now Ad-Free: Why I Removed Ads From My Islamic App

Dev.to • 6h ago

How-To

How To Be Productive — its not all about programming :)

Medium Programming • 6h ago

Welcome Thread - v371
How-To

Welcome Thread - v371

Dev.to • 6h ago

Discover More Articles