Building a Real-Time Speech-to-Text Pipeline with Deepgram + Next.js

Building a Real-Time Speech-to-Text Pipeline with Deepgram + Next.js Real-time speech-to-text (STT) converts spoken audio into text as it is being spoken, with latency under 300 milliseconds. Deepgram Nova-2 model offers 98.7% accuracy for English at .0043 per minute - 3x cheaper than AWS Transcribe. Prerequisites Node.js 20+, Next.js 15, Deepgram API key (free tier: 45K minutes) Step 1: Project Setup ash npx create-next-app@latest stt-demo --typescript --tailwind --app cd stt-demo npm install @deepgram/sdk Step 2: Backend WebSocket Route ` ypescript import { createClient, LiveTranscriptionEvents } from "@deepgram/sdk"; const deepgram = createClient(process.env.DEEPGRAM_API_KEY); const connection = deepgram.listen.live({ model: "nova-2", language: "en", smart_format: true, interim_results: true, vad_events: true, endpointing: 300, }); connection.on(LiveTranscriptionEvents.Transcript, (data) => { const transcript = data.channel.alternatives[0]?.transcript; if (transcript) console.log("T

Building a Real-Time Speech-to-Text Pipeline with Deepgram + Next.js

Related Articles

go-typedpipe: A Typed, Context-Aware Pipe for Go

What I've Learned Scaling Engineering Organisations

Make your own ColecoVision at home, part 5

unnix: Reproducible Nix environments without installing Nix

Muri: The Root Cause of Overburden

Related Articles

How-To
go-typedpipe: A Typed, Context-Aware Pipe for Go
Dev.to • 2h ago

How-To
What I've Learned Scaling Engineering Organisations
Dev.to • 3h ago

How-To
Make your own ColecoVision at home, part 5
Lobsters • 4h ago

How-To
unnix: Reproducible Nix environments without installing Nix
Lobsters • 12h ago

How-To
Muri: The Root Cause of Overburden
Dev.to • 14h ago