Back to articles
Building a Real-Time Speech-to-Text Pipeline with Deepgram + Next.js

Building a Real-Time Speech-to-Text Pipeline with Deepgram + Next.js

via Dev.toProRecruit

Building a Real-Time Speech-to-Text Pipeline with Deepgram + Next.js Real-time speech-to-text (STT) converts spoken audio into text as it is being spoken, with latency under 300 milliseconds. Deepgram Nova-2 model offers 98.7% accuracy for English at .0043 per minute - 3x cheaper than AWS Transcribe. Prerequisites Node.js 20+, Next.js 15, Deepgram API key (free tier: 45K minutes) Step 1: Project Setup ash npx create-next-app@latest stt-demo --typescript --tailwind --app cd stt-demo npm install @deepgram/sdk Step 2: Backend WebSocket Route ` ypescript import { createClient, LiveTranscriptionEvents } from "@deepgram/sdk"; const deepgram = createClient(process.env.DEEPGRAM_API_KEY); const connection = deepgram.listen.live({ model: "nova-2", language: "en", smart_format: true, interim_results: true, vad_events: true, endpointing: 300, }); connection.on(LiveTranscriptionEvents.Transcript, (data) => { const transcript = data.channel.alternatives[0]?.transcript; if (transcript) console.log("T

Continue reading on Dev.to

Opens in a new tab

Read Full Article
3 views

Related Articles