HTML vs Markdown vs SOM: Which Format Should Your AI Agent Use?

Every AI agent that browses the web faces the same question: how do you represent a web page to a language model? The default answer, raw HTML, is expensive and slow. A typical page dumps 30,000+ tokens into your context window, most of it CSS classes and layout divs. But what are the actual alternatives? And do they work? We ran WebTaskBench, 100 tasks across GPT-4o and Claude Sonnet 4, to find out. The results surprised us. The Three Representations When an agent needs to understand a web page, there are three common approaches: 1. Raw HTML The DOM as-is. Every <div> , every class="sc-1234 flex items-center gap-2" , every inline script. This is what most agents send today. <div class= "sc-1234 flex items-center gap-2 px-4 py-2" > <a href= "/about" class= "text-blue-500 hover:underline font-medium tracking-tight text-sm" > About </a> <span class= "text-gray-400" > | </span> <a href= "/pricing" class= "text-blue-500 hover:underline font-medium tracking-tight text-sm" > Pricing </a> </d

HTML vs Markdown vs SOM: Which Format Should Your AI Agent Use?

Related Articles

Anthropic Literally Sued the US Defense Department for Banning It While Giving the Contract to…

Here’s what Verge readers are buying during Amazon’s Big Spring Sale

Getting formal about quantum mechanics' lack of causality

From Moon hotels to cattle herding: 8 startups investors chased at YC Demo Day

I Tried Claude Code…and It Completely Changed How I Write Code

Related Articles

News
Anthropic Literally Sued the US Defense Department for Banning It While Giving the Contract to…
Medium Programming • 3h ago

News
Here’s what Verge readers are buying during Amazon’s Big Spring Sale
The Verge • 3h ago

News
Getting formal about quantum mechanics' lack of causality
Ars Technica • 4h ago

News
From Moon hotels to cattle herding: 8 startups investors chased at YC Demo Day
TechCrunch • 4h ago

News
I Tried Claude Code…and It Completely Changed How I Write Code
Medium Programming • 4h ago