I Used Three AI Agents on a Real Project. Here's What Each One Is Actually Good At.

The Setup I've been building k3d-manager — a shell CLI that stands up a full local Kubernetes stack: Vault, ESO, OpenLDAP, Istio, Jenkins, ArgoCD, Keycloak. The kind of thing that takes a week to wire up manually. I wanted it done in one command. At some point the project got complex enough that I stopped being able to hold it all in my head at once. So I brought in three agents: Claude handles planning and code review. Codex writes and modifies code. Gemini runs commands on the live cluster and verifies things actually work. That's been the theory for about three months. Here's what I've actually observed. Each Agent Has a Real Strength Profile This is the part most AI workflow articles skip. They talk about what agents can do. I want to talk about what each one is reliably good at versus where they consistently break down. Codex is a strong implementer. Give it a well-specified task — "add this function," "change these three lines," "apply this YAML fix" — and it does it cleanly. It

I Used Three AI Agents on a Real Project. Here's What Each One Is Actually Good At.

Related Articles

Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?

“Learn to Code” Is Dead… Learn to Think Instead

How One File Makes Claude Code Actually Follow Your Instructions

LeetCode Solution: 121. Best Time to Buy and Sell Stock

The Feature Took 2 Hours to Build — and 2 Weeks to Fix

Related Articles

How-To
Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?
Lobsters • 4d ago

How-To
“Learn to Code” Is Dead… Learn to Think Instead
Medium Programming • 4d ago

How-To
How One File Makes Claude Code Actually Follow Your Instructions
Medium Programming • 4d ago

How-To
LeetCode Solution: 121. Best Time to Buy and Sell Stock
Dev.to Tutorial • 4d ago

How-To
The Feature Took 2 Hours to Build — and 2 Weeks to Fix
Medium Programming • 4d ago