
Why Browser Agents Waste 99% of Their Tokens (And How to Fix It)
Every browser agent pays a hidden tax: tokens. When an agent visits a webpage, it dumps the DOM into an LLM. The LLM reads thousands of elements, reasons about which button to click, and generates a tool call. Then it does it again. And again. For a 10-step workflow, that's 25+ LLM round trips. Context grows with each step because conversation history accumulates. By step 10, you're sending 175,000 tokens per action. At frontier model pricing, that's roughly $4 for one workflow execution. Run it 1,000 times a day and you're burning $4,000 daily — on clicking buttons. The compounding problem The issue isn't that LLMs are expensive. It's that agent architectures make them exponentially more expensive with each step: Step 1: Inspect DOM (4,000 tokens) → Reason → Act Step 2: Inspect DOM + step 1 context (6,000 tokens) → Reason → Act Step 3: Inspect DOM + steps 1-2 context (8,000 tokens) → Reason → Act By step 10, your context window is carrying the entire conversation history. Each action
Continue reading on Dev.to Webdev
Opens in a new tab



