
How I Fingerprint My Own API to Catch Scrapers
TL;DR: Once you've stripped fingerprints from your data sources (Part 7), flip the script. Add your own watermarks so you can trace leaks back to specific customers. Coordinate jitter, price bucket skew, phantom records, and invisible text markers. All deterministic, all traceable, all invisible to users. In Part 7 , I discussed how to remove inbound fingerprints from your API responses. This includes things such as coordinates, addresses, pricing, etc. This was defense. This is offense. Now that you have paying customers, each with a unique API key, you can add a watermark to each API response that will allow you to track who is using your information. Want to know who's selling your data on a competitor's site after six months? Well, you'll know. These same techniques that catch plagiarizers, these same techniques that Google Maps uses to catch copycats, these same techniques that encyclopedias use to catch thieves. Here's some ideas on how you can implement them, and one important n
Continue reading on Dev.to Python
Opens in a new tab



