Back to articles
Final Chapter: Jenkins EFS Problem Solved - From 100% to 0% Throughput Usage
How-ToDevOps

Final Chapter: Jenkins EFS Problem Solved - From 100% to 0% Throughput Usage

via Dev.toYuto Takashi

TL;DR After three articles tracking down a Jenkins EFS performance issue, enabling Shared Library cache reduced throughput usage from 100% spikes to near 0% . This article covers the final results and the complete SRE process from emergency response to permanent fix. Previous Episodes (Quick Recap) This is the final article in a 4-part series: Episode 1 : How Git Temp Files Killed Our Jenkins Performance Problem: Jenkins slowed down, Git clone failures, 504 errors Discovery: EFS metadata IOPS exhaustion Culprit: ~15GB of tmp_pack_* files accumulating Emergency fix: Provisioned throughput 300 MiB/s + cleanup job Episode 2 : How I Spent $69 in 26 Hours (and How to Avoid It) Cost: $69 in 26 hours Lesson: Didn't know about Elastic Throughput (1/20th the cost) Learning: Decision process was sound, but not the optimal solution Episode 3 : How Jenkins Slowly Drained Our EFS Burst Credits Over 2 Weeks Key finding: Symptom appeared on 1/26, but root cause started on 1/13 Multiple factors: Share

Continue reading on Dev.to

Opens in a new tab

Read Full Article
6 views

Related Articles