FlareStart
HomeNewsHow ToSources
FlareStart

Where developers start their day. All the tech news & tutorials that matter, in one place.

Quick Links

  • Home
  • News
  • Tutorials
  • Sources
  • Privacy Policy

Connect

© 2026 FlareStart. All rights reserved.

Back to articles
Show HN: Robust LLM Extractor for Websites in TypeScript
How-ToWeb Development

Show HN: Robust LLM Extractor for Websites in TypeScript

via Hacker Newsandrew_zhong4h ago

We've been building data pipelines that scrape websites and extract structured data for a while now. If you've done this, you know the drill: you write CSS selectors, the site changes its layout, everything breaks at 2am, and you spend your morning rewriting parsers. LLMs seemed like the obvious fix — just throw the HTML at GPT and ask for JSON. Except in practice, it's more painful than that: - Raw HTML is full of nav bars, footers, and tracking junk that eats your token budget. A typical product page is 80% noise. - LLMs return malformed JSON more often than you'd expect, especially with nested arrays and complex schemas. One bad bracket and your pipeline crashes. - Relative URLs, markdown-escaped links, tracking parameters — the "small" URL issues compound fast when you're processing thousands of pages. - You end up writing the same boilerplate: HTML cleanup → markdown conversion → LLM call → JSON parsing → error recovery → schema validation. Over and over. We got tired of rebuildin

Continue reading on Hacker News

Opens in a new tab

Read Full Article
0 views

Related Articles

I Thought Learning More Tech Would Make Me a Better Developer — I Was Wrong
How-To

I Thought Learning More Tech Would Make Me a Better Developer — I Was Wrong

Medium Programming • 1h ago

How-To

How to Take Perfect App Store Screenshots Using Xcode Simulator (No Device Needed)

Medium Programming • 2h ago

Factor Promo Code: 50% Off Off Meal Prep
How-To

Factor Promo Code: 50% Off Off Meal Prep

Wired • 3h ago

Stop Learning Frameworks. Do This Instead ?
How-To

Stop Learning Frameworks. Do This Instead ?

Medium Programming • 3h ago

We Tested This FREE TradingView Trend Indicator… It Only Works Here!
How-To

We Tested This FREE TradingView Trend Indicator… It Only Works Here!

Medium Programming • 8h ago

Discover More Articles