
I built a tool that lets AI agents browse the real internet — and you can watch them do it
AI agents can write code and analyze data, but they can't browse a website, click a button, or fill out a form. They don't have a browser. So I built one. What is vscreen? A Rust service that gives AI agents a real Chromium browser and streams it to you live over WebRTC. You see what the AI sees in real-time — video, audio, everything. Mouse and keyboard relay back bidirectionally, so you can take over at any time. AI agents connect via MCP (Model Context Protocol) with 63 automation tools : navigate, screenshot, click, type, find elements, wait for page loads, dismiss cookie banners, solve CAPTCHAs, manage cookies, and more. Spin up multiple isolated instances — different agents working different tasks in parallel, with lease-based locking so they don't step on each other. The tech Written in Rust from scratch. Not a Puppeteer wrapper. A purpose-built media pipeline: tokio, axum, webrtc-rs, openh264/libvpx, Opus audio. unsafe_code = "forbid" across the workspace unwrap() denied, panic
Continue reading on Dev.to
Opens in a new tab


