GoodMonkey - 57% Reduction* in Claude Code Context via Extensible Proxy

I've been running Claude Code since it launched — long sessions, heavy tool use, complex multi-file work. And I kept hitting the same wall: around 200 turns, the model starts losing track. Responses slow down. It forgets decisions from 50 turns ago. Eventually, compaction kicks in and summarizes everything, which loses detail I actually need. So I dug into what was actually being sent to the API. Turns out, 89-91% of every request payload is content the model has already processed and will never reference again. Old grep results. File reads from 100 turns back. Thinking blocks that produced a response that's already in the conversation. All of it riding along on every single API call, diluting the model's attention. I decided to do something about it. What GoodMonkey Does GoodMonkey is a local HTTP proxy that sits between your LLM agent and the Anthropic API. Requests and responses flow through a plugin pipeline where each plugin can inspect, transform, or block content — transparently

GoodMonkey - 57% Reduction* in Claude Code Context via Extensible Proxy

Related Articles

Lenovo's new PCs offer a glimpse of the future - and it's modular

Remote Control Is the Last Piece. Anthropic’s Agent Stack Is Now Complete.

You can't always fix it

Engineering Managers Ask “Will It Scale?” Product Managers Ask “Will It Matter?”

I Fuzzed, and Vibe Fixed, the Vibed C Compiler

Related Articles

News
Lenovo's new PCs offer a glimpse of the future - and it's modular
ZDNet • 7h ago

News
Remote Control Is the Last Piece. Anthropic’s Agent Stack Is Now Complete.
Medium Programming • 7h ago

News
You can't always fix it
Lobsters • 7h ago

News
Engineering Managers Ask “Will It Scale?” Product Managers Ask “Will It Matter?”
Medium Programming • 7h ago

News
I Fuzzed, and Vibe Fixed, the Vibed C Compiler
Lobsters • 8h ago