Save money on AI using those permanent free LLM APIs

Those LLM APIs offer permanent free tiers for text inference (no trial or initial credits, permanent tier only). Contents Provider APIs Inference providers Provider APIs APIs run by the companies that train or fine-tune the models themselves. Cohere 🇺🇸 - Command A, Command R+, Aya Expanse 32B +9 more. 20 RPM, 1K/mo. Google Gemini 🇺🇸 - Gemini 2.5 Pro, Flash, Flash-Lite +4 more. 5-15 RPM, 100-1K RPD. Mistral AI 🇪🇺 - Mistral Large 3, Small 3.1, Ministral 8B +3 more. 1 req/s, 1B tok/mo. Zhipu AI 🇨🇳 - GLM-4.7-Flash, GLM-4.5-Flash, GLM-4.6V-Flash. Limits undocumented. Inference providers Third-party platforms that host open-weight models from various sources. Cerebras 🇺🇸 - Llama 3.3 70B, Qwen3 235B, GPT-OSS-120B +3 more. 30 RPM, 14,400 RPD. Cloudflare Workers AI 🇺🇸 - Llama 3.3 70B, Qwen QwQ 32B +47 more. 10K neurons/day. GitHub Models 🇺🇸 - GPT-4o, Llama 3.3 70B, DeepSeek-R1 +more. 10-15 RPM, 50-150 RPD. Groq 🇺🇸 - Llama 3.3 70B, Llama 4 Scout, Kimi K2 +17 more. 30 RPM, 1K RPD (14,400 for Llam

Save money on AI using those permanent free LLM APIs

Related Articles

Mark Zuckerberg texted Elon Musk to offer help with DOGE

When All You Can Do Is All or Nothing, Do Nothing

“# Epilogue of the Five Nations Chronicle (Part 7)

How Programming Paradigms Are Born

Tech Companies Are Quietly Becoming Banks — And No One’s Talking About It

Related Articles

News
Mark Zuckerberg texted Elon Musk to offer help with DOGE
TechCrunch • 1h ago

News
When All You Can Do Is All or Nothing, Do Nothing
Lobsters • 1h ago

News
“# Epilogue of the Five Nations Chronicle (Part 7)
Medium Programming • 2h ago

News
How Programming Paradigms Are Born
Medium Programming • 2h ago

News
Tech Companies Are Quietly Becoming Banks — And No One’s Talking About It
Medium Programming • 3h ago