
How I Built a Free AI Voice Separator for Podcasts and Interviews
My first Dev.to post about building ToolsOnFire covered the broad overview. This time I want to go deep on one specific tool: the Voice Separator . * The Problem * I kept seeing the same requests in podcasting and journalism communities: "I recorded an interview and need to edit just one speaker's audio" or "I need a transcript that shows who said what." The existing options were either expensive (Descript at $24/month), required desktop software, or didn't actually separate the audio - they just labelled who spoke when. I wanted to build something that: 1. Identifies each speaker in a recording Creates separate downloadable audio files per speaker Allows users to separate background music into a separate file Produces a timestamped transcript with speaker labels Is free to try without creating an account The Challenges Transcript accuracy is never perfect. This was the biggest reality check. No matter which AI model you use, transcripts will have errors - especially with accents, tech
Continue reading on Dev.to Webdev
Opens in a new tab




