
How I reduced AI coding costs by 94% — and built a CLI to do it automatically
Every time you ask Claude or ChatGPT about your code, it reads everything. Your entire codebase. Every single query. On a 44-file Python project that's 41,160 tokens per query. At GPT-4o prices ($2.50/1M tokens) — $0.10 every time you ask a question. At 50 queries a day,that's $147/month. Per developer.I got tired of this and built https://getkodara.dev . What it does Kodara scans your repo once, builds a dependency graph and architectural memory, then returns only the 2–8 files actually relevant to your question. pip install kodara cd your-project kodara init kodara ask "How does authentication work?" Output: ## auth/middleware.py Defines AuthMiddleware. Exports: verify_token, require_auth. Depends on: jwt_service.py, models/user.py ## auth/jwt_service.py Defines JWTService. Exports: encode, decode, refresh. [3/44 modules · 1,840 tokens · 94% reduction] Paste that into Claude, ChatGPT, Cursor — whatever you use. Your AI now has surgical context instead of reading everything blindly. R
Continue reading on Dev.to Python
Opens in a new tab



