
PageAgent: The GUI Agent Living in Your Web Page
Most AI agent frameworks need a server, a headless browser, and a whole automation stack just to click a button on a web page. The page itself has no say in the process. PageAgent takes a different approach. It's a JavaScript library that runs directly in your page. Add it, and users can give natural language commands — the AI reads the live DOM, understands the UI, and acts. No server, no external process, no automation stack. This means your web app isn't being automated — it's doing the automating. You control what the AI sees, how it behaves, which LLM powers it. The intelligence lives in your page, not on someone else's server. ⭐ Star PageAgent on GitHub — MIT licensed, open source, 600+ stars. Zero Infrastructure For npm projects, the programmatic API is just as clean: import { PageAgent } from ' page-agent ' const agent = new PageAgent ({ model : ' gpt-5.1 ' , baseURL : ' https://api.openai.com/v1 ' , apiKey : YOUR_KEY , }) await agent . execute ( ' Fill the expense report for l
Continue reading on Dev.to Webdev
Opens in a new tab




