
I Built an API That Turns Any URL Into Structured JSON — Here's How
Most web scrapers give you raw HTML. I wanted clean, structured JSON from any URL — no configuration, no selectors, no parsing. Just pass a URL and get organized data back. So I built it. What It Does The Web Content Extractor API takes any URL and returns structured JSON. It automatically detects the content type: Articles → title, author, date, full text, headings Products → name, price, rating, reviews, SKU, images Recipes → ingredients, instructions, prep time, servings Job postings → title, company, salary, location Events → name, date, location, description Any webpage → metadata, content, links, images How It Works Fetch the HTML Auto-detect content type using Open Graph tags, Schema.org markup, and DOM signals Score content blocks (Readability-style algorithm) to find the main content Extract structured data: metadata, headings, images, links, JSON-LD Return clean JSON Quick Example curl "https://george-the-developer--web-content-extractor-api.apify.actor/extract?url=https://te
Continue reading on Dev.to Webdev
Opens in a new tab



