
I Built an AI Audio Dubbing Service Using Claude
"What if I let an AI coding agent build an entire production app from scratch?" β So I tried it. Instead of building something trivial like a to-do list, I wanted to create something actually useful. The idea: upload any audio or video file, pick a target language, and get back a fully dubbed MP3. I called it AgentDub ποΈ How It Works The core is a 3-step AI pipeline: Uploaded File β βΌ β Speech-to-Text ElevenLabs Scribe API β Extracts speech from the uploaded file as text β βΌ β‘ Translation Google Gemini 2.5 Flash β Translates the text into the target language β βΌ β’ Text-to-Speech ElevenLabs TTS (Multilingual v2) β Converts translated text back into natural audio β βΌ Download dubbed MP3 For example: upload a 30-second English voice memo, select Korean, and within a couple of minutes you get a Korean-dubbed MP3 back. Screenshots Dashboard β File Upload & Language Selection Drag and drop your file, pick from 18 languages, hit Generate. The dubbed audio plays directly in the browser. One cl
Continue reading on Dev.to
Opens in a new tab


