I built a 250-tool AI API on a Raspberry Pi 5 — architecture, economics, and what I learned

Six months ago I started building an AI tool API in my apartment. The idea: I was paying $60/month across OpenAI, Anthropic, and various scraping tools — using maybe 2% of what I paid for. Why isn't there a pay-per-call option? So I built one. AiPayGen is a single API with 250+ pre-built tools and 15 AI models from 7 providers. You call /research , /scrape_website , /sentiment , /translate — not raw model completions. Pay per call, starting at $0.004. The Architecture Client -> Cloudflare Tunnel -> Gunicorn (2 workers, 4 threads) -> Flask app -> Model Router -> [Anthropic|OpenAI|Google|DeepSeek|xAI|Together] -> SQLite (WAL mode) for everything: auth, billing, memory, job queue Why SQLite? One file, zero config, WAL mode handles concurrent reads. At my scale (< 1000 req/day), it outperforms Postgres by eliminating network round trips. The entire billing system is atomic deductions in SQLite. Why Flask? I know it. Shipping speed > perfect architecture for a solo project. The Model Router

I built a 250-tool AI API on a Raspberry Pi 5 — architecture, economics, and what I learned

Related Articles

Lululemon bets Epoch Biodesign can eat its shorts, literally

Crusoe makes big battery buys for its data centers

What Your Engineering Manager Actually Does All Day

The Lego Game Boy makes for a great gift, and it’s $10 off today

How To Apply Global Filters With EF Core Query Filters

Related Articles

How-To
Lululemon bets Epoch Biodesign can eat its shorts, literally
TechCrunch • 1h ago

How-To
Crusoe makes big battery buys for its data centers
TechCrunch • 5h ago

How-To
What Your Engineering Manager Actually Does All Day
Medium Programming • 6h ago

How-To
The Lego Game Boy makes for a great gift, and it’s $10 off today
The Verge • 7h ago

How-To
How To Apply Global Filters With EF Core Query Filters
Medium Programming • 7h ago