FlareStart
HomeNewsHow ToSources
FlareStart

Where developers start their day. All the tech news & tutorials that matter, in one place.

Quick Links

  • Home
  • News
  • Tutorials
  • Sources
  • Privacy Policy

Connect

© 2026 FlareStart. All rights reserved.

Back to articles
Show HN: Open-source playground to red-team AI agents with exploits published
How-ToSecurity

Show HN: Open-source playground to red-team AI agents with exploits published

via Hacker Newszachdotai3h ago

We build runtime security for AI agents. The playground started as an internal tool that we used to test our own guardrails. But we kept finding the same types of vulnerabilities because we think about attacks a certain way. At some point you need people who don't think like you. So we open-sourced it. Each challenge is a live agent with real tools and a published system prompt. Whenever a challenge is over, the full winning conversation transcript and guardrail logs get documented publicly. Building the general-purpose agent itself was probably the most fun part. Getting it to reliably use tools, stay in character, and follow instructions while still being useful is harder than it sounds. That alone reminded us how early we all are in understanding and deploying these systems at scale. First challenge was to get an agent to call a tool it's been told to never call. Someone got through in around 60 seconds without ever asking for the secret directly (which taught us a lot). Next challe

Continue reading on Hacker News

Opens in a new tab

Read Full Article
0 views

Related Articles

How I synced real-time CS2 predictions with Twitch stream delay
How-To

How I synced real-time CS2 predictions with Twitch stream delay

Dev.to • 23m ago

The Go Paradox: Why Go’s Simplicity Creates Complexity
How-To

The Go Paradox: Why Go’s Simplicity Creates Complexity

Medium Programming • 6h ago

How-To

The Cube That Taught Me to Code

Medium Programming • 7h ago

Data quality testing: how Bruin and dbt take different paths to the same goal
How-To

Data quality testing: how Bruin and dbt take different paths to the same goal

Dev.to • 7h ago

A Funeral for the Coder
How-To

A Funeral for the Coder

Dev.to • 8h ago

Discover More Articles