Back to articles
I Built My Own Site Crawler

I Built My Own Site Crawler

via Dev.toCarlos Arias

I lost my job last year. So I did what most engineers do when they don’t have a safety net. I started taking on clients. Mostly in the legal space, doing SEO and digital marketing. Very quickly, I noticed something stupid. Every project had the same workflow: Open the client’s website → copy content → paste into GPT → try to “train” it on their business. Over and over again. No structure. No consistency. Just manual work pretending to be “AI-powered.” So I looked for tools that could automate this. I found things like Firecrawl. On paper, it solves the problem. In reality: It gets expensive fast if you're doing this at scale It’s not always reliable; And it’s still not really built for how people actually use LLMs day-to-day Most of these tools feel like they were built for demos, not production workflows. So I built my own crawler. Not a “vibe coded” wrapper. An actual tool designed for one job: Extract clean, structured content from websites so it can be used directly with LLMs. No f

Continue reading on Dev.to

Opens in a new tab

Read Full Article
3 views

Related Articles