Designing an AI approval system: when should your agent ask for permission?

An AI agent that can only read data is safe but useless. An AI agent that can send emails, delete files, format disks, and configure VPNs is useful but terrifying. The entire value of a personal AI operating system comes from the agent acting on your behalf — and the entire risk comes from the same thing. We needed a system that says "yes" fast to everyday operations and "are you sure?" to dangerous ones. Not a blanket confirmation on everything (that just trains the user to click "approve" without reading). Not unrestricted access either (one prompt injection away from rm -rf / ). This post is about the 4-level approval system we built, the dual-layer architecture (shell + API), and the surprisingly difficult design decision of where to draw the line between "just do it" and "ask me first." The two attack surfaces An AI agent in our system can cause damage in two completely different ways: Shell execution. The agent uses the runtime's exec tool to run commands on the host machine. Thi

Designing an AI approval system: when should your agent ask for permission?

Related Articles

#05 Frozen Pipes

Replace Doom Scrolling With Intentional Reading

Web Color "Wheel" Chart

Im looking for indie apps and tools built by solo developers, their stories and perspectives for a newsletter I’m starting. If you know a solo maker or use an overlooked gem built by one please let me know! 🙏

Building a DIY OpenClaw

Related Articles

How-To
#05 Frozen Pipes
Dev.to • 4h ago

How-To
Replace Doom Scrolling With Intentional Reading
Dev.to • 7h ago

How-To
Web Color "Wheel" Chart
Dev.to • 11h ago

How-To
Im looking for indie apps and tools built by solo developers, their stories and perspectives for a newsletter I’m starting. If you know a solo maker or use an overlooked gem built by one please let me know! 🙏
Dev.to • 23h ago

How-To
Building a DIY OpenClaw
Lobsters • 1d ago