I ran 765 controlled experiments to prove AI agents are leaking your data — and built the tool that catches it

Every AI agent that can read private data, fetch external content, and send outbound messages is one injected instruction away from exfiltrating everything it knows. This isn't theoretical. Here's the attack in three tool calls: Turn 0: readPrivateData() → 5 customer records loaded (SSNs, emails, phones) fetchExternalContent(url) → attacker's webpage, payload embedded in HTML Turn 1: sendOutboundReport() → all PII sent to attacker's address Turn 2: "Report sent successfully!" Total time: ~12 seconds. Cost: $0.001. No exploits. No credentials. Just a fetched webpage and a compliant model. We measured it. Rigorously. 30 injection payloads across 6 categories — direct injection, encoded/obfuscated (Base64, ROT13, hex, Unicode), social engineering (CEO fraud, IT impersonation, legal threats), multi-turn (persistent rules, delayed triggers, context poisoning), multilingual (Spanish, Mandarin, Arabic, Russian), and advanced techniques. Tested against three major LLM providers. N=285 total ru

I ran 765 controlled experiments to prove AI agents are leaking your data — and built the tool that catches it

Related Articles

POV: You’re Entering the New Era of Coding…

How We Turned a 2,000-Line Pull Request Into 10 Simple Decisions

Outer Membrane Vesicles of the Mammary Microbiota and NLRP3 Inflammasome Activation: A…

Never snooze a future

The “Middle-Class Developer” Is Facing an Extinction Event

Related Articles

News
POV: You’re Entering the New Era of Coding…
Medium Programming • 5h ago

News
How We Turned a 2,000-Line Pull Request Into 10 Simple Decisions
Medium Programming • 6h ago

News
Outer Membrane Vesicles of the Mammary Microbiota and NLRP3 Inflammasome Activation: A…
Medium Programming • 7h ago

News
Never snooze a future
Lobsters • 8h ago

News
The “Middle-Class Developer” Is Facing an Extinction Event
Medium Programming • 8h ago