I Poisoned My Own MCP Server in 5 Minutes. Here's How.

Last week I set up a simple MCP server for file operations. Then I wondered: what happens if I put instructions in the tool description that the LLM isn't supposed to follow? Turns out, it follows them. Every time. This post walks through three attacks I ran against my own AI agent. All of them worked. No exploits, no buffer overflows — just text in the wrong place. Setup: a normal MCP server Here's a minimal MCP server that reads files. Nothing unusual. # server.py — a "safe" file reader from mcp.server.fastmcp import FastMCP mcp = FastMCP ( " file-reader " ) @mcp.tool () def read_file ( path : str ) -> str : """ Read a file and return its contents. """ with open ( path ) as f : return f . read () if __name__ == " __main__ " : mcp . run () You register it in Claude Desktop or Cursor, approve the tool, and start using it. The tool description says "Read a file and return its contents." That's what the LLM sees. Here's the thing: the LLM trusts that description completely. It's part of

I Poisoned My Own MCP Server in 5 Minutes. Here's How.

Related Articles

Switzerland — Best Crypto Exchange (2026)

The Difference between `let`, `var` and `const`

Circulation Metrics Framework for Living Systems

Red Rooms makes online poker as thrilling as its serial killer

Don’t Know What Project to Build? Here Are Developer Projects That Actually Make You Better

Related Articles

How-To
Switzerland — Best Crypto Exchange (2026)
Dev.to Beginners • 3h ago

How-To
The Difference between `let`, `var` and `const`
Medium Programming • 12h ago

How-To
Circulation Metrics Framework for Living Systems
Medium Programming • 14h ago

How-To
Red Rooms makes online poker as thrilling as its serial killer
The Verge • 17h ago

How-To
Don’t Know What Project to Build? Here Are Developer Projects That Actually Make You Better
Medium Programming • 18h ago