Back to articles
The 18 Minutes That Tested Me
NewsDevOps

The 18 Minutes That Tested Me

via Dev.toNema Chandra Goswami

Monday. 10:15 AM. Coffee in hand. Life was good. Then my phone buzzed. It was Jaya from Sales. “The client portal is down. Customers are calling. What’s happening?” That one sentence changes the atmosphere in seconds. I opened the production URL. 502 Bad Gateway. Silence. The Situation was... Our platform was running on Amazon EC2, served through Nginx. It had been stable for months. No recent risky deployments. No alerts overnight. So why now? I immediately looped in: My manager, Aman Jaya from Sales Backend & DevOps team Aman asked the question every leader asks: “What’s the impact?” Jaya didn’t sugarcoat it. “Two new companies was onboarded recently.” Pressure level? Maximum. The Investigation... ✔ EC2 instance — Running ✔ CPU — Normal ✔ Memory — Stable ✔ Nginx — Active Everything looked… fine. But production doesn’t lie. We checked application logs. And there it was. Database connection failures... At the same time, Sales had launched a marketing campaign that morning. Traffic spik

Continue reading on Dev.to

Opens in a new tab

Read Full Article
2 views

Related Articles