LLM Context Windows: Managing Tokens in Production AI Apps

The Token Budget Problem Claude claude-sonnet-4-6 has a 200k token context window. GPT-4o has 128k. These sound enormous until you're building a RAG application that needs to pass document context, conversation history, system prompts, and tool definitions simultaneously. Running out of context window mid-conversation is an unrecoverable failure. Managing it is an engineering discipline. Counting Tokens import Anthropic from ' @anthropic-ai/sdk ' ; import { encoding_for_model } from ' tiktoken ' ; // for OpenAI // Anthropic: use the API's token counting endpoint const anthropic = new Anthropic (); async function countTokens ( messages : Anthropic . MessageParam []) { const response = await anthropic . messages . countTokens ({ model : ' claude-sonnet-4-6 ' , messages , system : ' You are a helpful assistant. ' , }); return response . input_tokens ; } // OpenAI: use tiktoken locally (no API call needed) function countOpenAITokens ( text : string , model = ' gpt-4o ' ): number { const en

LLM Context Windows: Managing Tokens in Production AI Apps

Related Articles

#05 Frozen Pipes

Replace Doom Scrolling With Intentional Reading

Web Color "Wheel" Chart

Im looking for indie apps and tools built by solo developers, their stories and perspectives for a newsletter I’m starting. If you know a solo maker or use an overlooked gem built by one please let me know! 🙏

Building a DIY OpenClaw

Related Articles

How-To
#05 Frozen Pipes
Dev.to • 6h ago

How-To
Replace Doom Scrolling With Intentional Reading
Dev.to • 9h ago

How-To
Web Color "Wheel" Chart
Dev.to • 13h ago

How-To
Im looking for indie apps and tools built by solo developers, their stories and perspectives for a newsletter I’m starting. If you know a solo maker or use an overlooked gem built by one please let me know! 🙏
Dev.to • 1d ago

How-To
Building a DIY OpenClaw
Lobsters • 1d ago