
What Is Crash Recovery? How Process Managers Keep Your App Online After Failures
What Is Crash Recovery? Your production app crashes. A bug slips through, memory spikes, a network dependency times out and throws an unhandled exception — it doesn't matter why. What matters is what happens next. Crash recovery is the automatic process of detecting that an application has died and restarting it as fast as possible, before your users have time to notice. Without crash recovery, a process that crashes stays dead until a human intervenes. With it, the same crash can be invisible — the process restarts in milliseconds and keeps serving traffic. Crash recovery is one of the core reasons you need a process manager in production — without one, there's nothing watching your app to trigger a restart. How Crash Recovery Works Every operating system gives processes a way to signal their exit. When a process terminates — whether it crashes, runs out of memory, or is killed — it emits an exit event with a status code. A process manager listens for these events: App process exits (
Continue reading on Dev.to
Opens in a new tab




