Why I Switched From GPT-4 to Small Language Models for Two of My Products

GPT-4 and Claude Sonnet are not always the right model for the job. After 18 months of running AI products in production, I've moved two of my products from frontier models to small language models — and the results have been better latency, lower cost, and in one case, higher accuracy on the specific task. Here is exactly what I did and why. Background: The Two Products That Changed Product 1: AgriIntel — Crop recommendation classification AgriIntel uses AI to classify incoming sensor data events and route them to the appropriate recommendation workflow. The classification task is: Given a set of sensor readings (soil moisture, temperature, nutrient levels, weather forecast), classify what type of agronomic decision is needed: Irrigation Fertilization Pest management Harvest timing No action This is a classification task with a fixed taxonomy. GPT-4o was doing it well — but at $0.005 per classification , at 15,000+ classifications per day , the cost was significant. Latency was also 8

Why I Switched From GPT-4 to Small Language Models for Two of My Products

Related Articles

Read Receipts: An iMessage Simulator

Why 60,000 Repos Adopted AGENTS.md

Intel and LG Display may have beaten Apple and Qualcomm with the best laptop battery life ever

FiberBills: A Complete Billing & Collection System for ISPs and Subscription Businesses

Prompting as Probabilistic Programming

Related Articles

News
Read Receipts: An iMessage Simulator
Lobsters • 1h ago

News
Why 60,000 Repos Adopted AGENTS.md
Medium Programming • 1h ago

News
Intel and LG Display may have beaten Apple and Qualcomm with the best laptop battery life ever
The Verge • 2h ago

News
FiberBills: A Complete Billing & Collection System for ISPs and Subscription Businesses
Medium Programming • 3h ago

News
Prompting as Probabilistic Programming
Medium Programming • 3h ago