FlareStart
HomeNewsHow ToSources
FlareStart

Where developers start their day. All the tech news & tutorials that matter, in one place.

Quick Links

  • Home
  • News
  • Tutorials
  • Sources
  • Privacy Policy

Connect

© 2026 FlareStart. All rights reserved.

Back to articles
Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production
How-ToProgramming Languages

Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production

via Dev.to PythonHumza Tareen1mo ago

Our embedding model got deprecated overnight. Every RAG query started returning 404s. Here's the exact playbook we used to migrate to a new model in 48 hours with zero downtime. The Situation Service: RAG retrieval service using pgvector on PostgreSQL Old model: text-embedding-004 (deprecated) New model: text-embedding-3-large (768 dimensions) Data volume: Thousands of embedded documents Constraint: Zero downtime, zero data loss, production traffic must keep flowing Step 1: Make the Model Configurable Before anything else, stop hardcoding: # Before (hardcoded in 6 places) response = openai . embeddings . create ( model = " text-embedding-004 " , input = text , ) # After (configured once) EMBED_MODEL = os . getenv ( " EMBED_MODEL " , " text-embedding-3-large " ) EMBED_DIMENSIONS = int ( os . getenv ( " EMBED_DIMENSIONS " , " 768 " )) response = openai . embeddings . create ( model = EMBED_MODEL , input = text , dimensions = EMBED_DIMENSIONS , ) Two environment variables. This is what ma

Continue reading on Dev.to Python

Opens in a new tab

Read Full Article
16 views

Related Articles

150 million users later, Roblox competitor Rec Room is shutting down
How-To

150 million users later, Roblox competitor Rec Room is shutting down

The Verge • 19h ago

Here are our favorite spring cleaning deals from Amazon’s Big Spring Sale
How-To

Here are our favorite spring cleaning deals from Amazon’s Big Spring Sale

The Verge • 20h ago

What we’re looking for in Startup Battlefield 2026 and how to put your best application forward
How-To

What we’re looking for in Startup Battlefield 2026 and how to put your best application forward

TechCrunch • 1d ago

Build Days That Actually Mean Something
How-To

Build Days That Actually Mean Something

Medium Programming • 1d ago

I have blogged about the difference between code coverage and test coverage and why it matters to distinguish between these 2.
How-To

I have blogged about the difference between code coverage and test coverage and why it matters to distinguish between these 2.

Dev.to Beginners • 1d ago

Discover More Articles