What Broke When We Pushed WebSockets From 100k to 1M Users

A post-mortem on OOM kills, GC pauses, and the slow consumers that ate our RAM. What Broke When We Pushed WebSockets From 100k to 1M Users A post-mortem on OOM kills, GC pauses, and the slow consumers that ate our RAM. We thought we had a leak. Turns out, we just didn’t know how to turn off the tap. At 100k users, everything looked perfect. The dashboards were green, latency was flat, and we felt like geniuses. At 1M users, the exact same architecture started killing nodes like clockwork. We were building a live commentary platform for a massive sports event. The premise was simple: ingest scores, push them to the browser. We tested it. We load-tested it. We thought we were ready. Then the finals started. The user count ticked past 300k, and latency jittered. By 600k, the alerts weren’t just pinging; they were screaming. By 800k, our nodes turned into zombies — connected, technically “alive,” but totally unresponsive — before being abruptly shot in the head by the Linux Out-Of-Memory (

What Broke When We Pushed WebSockets From 100k to 1M Users

Related Articles

Clean Code Principles Every Software Engineer Should Follow

The Real Cost of Abstractions in .NET

Stop Learning Frameworks — You’re Wasting Your Time

How to Self-Host n8n in 2026: VPS vs Managed Hosting (Full Comparison)

I Built a Mac App to Fix Android File Transfer — Here’s What I Learned

Related Articles

How-To
Clean Code Principles Every Software Engineer Should Follow
Medium Programming • 32m ago

How-To
The Real Cost of Abstractions in .NET
Medium Programming • 1h ago

How-To
Stop Learning Frameworks — You’re Wasting Your Time
Medium Programming • 2h ago

How-To
How to Self-Host n8n in 2026: VPS vs Managed Hosting (Full Comparison)
Dev.to • 2h ago

How-To
I Built a Mac App to Fix Android File Transfer — Here’s What I Learned
Medium Programming • 2h ago