50 Lines of TypeScript to Automate Any Website with AI

Most AI browser automation tools fall into two camps: vision-based (screenshot the page, send it to a model, get click coordinates) or selector-based (CSS/XPath targeting). Both have fundamental problems. Vision is slow and expensive — every action requires a screenshot round-trip to a vision model. Selectors are brittle and meaningless to an LLM — div.container > ul > li:nth-child(3) > a tells the AI nothing about what it's clicking. There's a third approach: text snapshots with numbered refs . The browser's accessibility tree is already a structured, semantic representation of the page. Give it to the AI as text, let the AI pick a ref, execute the action. No vision, no selectors, deterministic targeting. Here's a complete AI browser agent using browserclaw and Claude: import Anthropic from " @anthropic-ai/sdk " ; import { BrowserClaw } from " browserclaw " ; const anthropic = new Anthropic (); const SYSTEM = `You are a browser automation agent. You receive a text snapshot of a web pa

50 Lines of TypeScript to Automate Any Website with AI

Related Articles

Cluely CEO Roy Lee admits to publicly lying about revenue numbers last year

Your Flutter App Works. But Can You Answer These 20 Questions?

I Haven’t Written a Line of Code in Six Months

What’s the Real Cost of Exception-Driven Flow in .NET? I Measured It.

Is Amazon down? Why customers can't check out or add to cart - and Amazon's response

Related Articles

News
Cluely CEO Roy Lee admits to publicly lying about revenue numbers last year
TechCrunch • 3h ago

News
Your Flutter App Works. But Can You Answer These 20 Questions?
Medium Programming • 3h ago

News
I Haven’t Written a Line of Code in Six Months
Medium Programming • 3h ago

News
What’s the Real Cost of Exception-Driven Flow in .NET? I Measured It.
Medium Programming • 3h ago

News
Is Amazon down? Why customers can't check out or add to cart - and Amazon's response
ZDNet • 3h ago