![[AutoBe] We Built an AI That Writes Full Backend Apps — Then Broke Its 100% Success Rate on Purpose with Weak Local LLMs](/_next/image?url=https%3A%2F%2Fmedia2.dev.to%2Fdynamic%2Fimage%2Fwidth%3D800%252Cheight%3D%252Cfit%3Dscale-down%252Cgravity%3Dauto%252Cformat%3Dauto%2Fhttps%253A%252F%252Fdev-to-uploads.s3.amazonaws.com%252Fuploads%252Farticles%252Fttv46fap8j4z8wt0nr6l.png&w=1200&q=75)
[AutoBe] We Built an AI That Writes Full Backend Apps — Then Broke Its 100% Success Rate on Purpose with Weak Local LLMs
TL;DR Github Repository: https://github.com/wrtnlabs/autobe Generated Examples: https://github.com/wrtnlabs/autobe-examples AutoBe is an open-source AI agent that generates complete backend applications (TypeScript + NestJS + Prisma) from natural language. We adopted Korean SI methodology (no code reuse) and hit 100% compilation + near-100% runtime success Real-world use exposed it as unmaintainable, so we rebuilt everything around modular code generation Success rate cratered to 40% — we clawed it back by: RAG optimization for context management Stress-testing with weak local LLMs (30B, 80B) to discover edge cases Killing the system prompt — replacing prose instructions with strict function calling schemas and validation feedback A 6.75% raw function calling success rate becomes 100% through validation feedback alone With GLM v5 (local LLM), we're back to 100% compilation success AutoBe is no longer a one-shot prototype builder — it now supports incremental feature addition, removal,
Continue reading on Dev.to
Opens in a new tab



