
Will it roast? We tested Kilo Code Reviewer's Roast Mode on 5 Levels of Terrible Code
Kilo Code's Code Reviews now has a Roast Mode. Instead of polite suggestions, it reviews your PRs with brutal honesty. We've previously tested Code Reviews for accuracy with both free and frontier models. This time, we wanted to see how far the roasting goes. We built a clean bookstore API, created 5 PRs with escalating code crimes, and ran Roast Mode with Claude Opus 4.6 on each one. TL;DR: Roast Mode found 39 real issues across 5 PRs with zero false positives. It escalated from "address before merge" to "do NOT merge" as the code got worse, and every finding included a fix recommendation alongside the roast. The Setup We built a bookstore API using TypeScript with Hono, Prisma, and SQLite. The base code was clean: proper types, Zod validation, JWT auth middleware, consistent patterns across all routes. From that base, we created 5 PRs. Each one added a real feature with increasingly terrible code. All 5 PRs were reviewed with Roast Mode enabled, all focus areas on, using Claude
Continue reading on Dev.to
Opens in a new tab




