
Urban Recon: Breaking through the Grid with Local Vision AI
Exploring the architecture of a text-free geolocation engine powered by OpenStreetMap and LM Studio. The Concept Standard Geoguessing is easy for humans—look for a flag, a language, or a unique license plate. But what if all those labels were stripped away? What if all you had was the urban blueprint —the skeletal structure of a city’s streets, plazas, and canals? Urban Recon is an experiment in "pure" urban recognition. It challenges a local vision-capable AI model to identify famous cities based on anonymized map data. 🏗️ Technical Architecture The application follows a "Zero-Trust Visual" architecture. We don't send coordinates to the agent; we send a "mission blueprint." Anonymized Map Engine : Using Leaflet.js , we pull specific CartoDB Voyager No Labels tiles. This ensures that even the most zoomed-in view contains no text—just pixels of urban form. Visual Capture Pipeline : Because local LLMs (running in LM Studio) can't easily fetch browser URLs, we use html-to-image to capture
Continue reading on Dev.to JavaScript
Opens in a new tab

