Next.js 14 cron scraping: rate limits + retries

Run a daily scraper on Vercel. Without melting sources. Enforce per-host rate limits in Node. Not “sleep(1000)”. Make retries idempotent with Postgres locks. Store failures so I can re-run only the broken ones. Context I’m building a job board for Psychiatric Mental Health Nurse Practitioners. 8,000+ active listings. 2,000+ companies. The pipeline scrapes 200+ jobs daily from multiple sources. Some are nice JSON feeds. Most aren’t. My first version was dumb. One cron. One loop. Fetch everything. It worked. Until it didn’t. 429s. Random 403s. Timeouts. Worse — half a run would succeed, then retries would duplicate work and waste time. This post is how I stabilized it. Rate limiting by host. Jitter. Backoff. And a Postgres lock so reruns don’t stomp each other. 1) I stopped using “one cron to rule them all” I used to do this: “Cron hits /api/scrape and that endpoint scrapes everything.” Brutal. One slow host makes the whole run slow. And Vercel timeouts become your scheduler. Now I split

Next.js 14 cron scraping: rate limits + retries

Related Articles

Nobody Warned Me About This Part of Being a Junior Developer

Talent gets the spotlight. Discipline builds the legacy.

Coding in the Age of Co-Pilots: Why Developers Who Think Will Win

Two more EVs for the trash heap: Volvo EX30 and Honda Prologue

Building Your First Interactive Flutter App (Dicee)

Related Articles

How-To
Nobody Warned Me About This Part of Being a Junior Developer
Medium Programming • 23m ago

How-To
Talent gets the spotlight. Discipline builds the legacy.
Medium Programming • 1h ago

How-To
Coding in the Age of Co-Pilots: Why Developers Who Think Will Win
Medium Programming • 2h ago

How-To
Two more EVs for the trash heap: Volvo EX30 and Honda Prologue
The Verge • 3h ago

How-To
Building Your First Interactive Flutter App (Dicee)
Medium Programming • 3h ago