
987 Crash Loops at 3AM: What I Learned the Hard Way
987 Crash Loops at 3AM: What I Learned the Hard Way How It Started At 03:35 AM, a cron job lit the fuse. openclaw-auto-update — what a beautiful name. "Automatic upgrade," effortless, worry-free. It ran every Wednesday at 3:30 AM, upgrading OpenClaw from v2026.2.17 to v2026.2.23. The upgrade itself succeeded. Then everything collapsed. When the new version started, it read the config and found a plugin called google-antigravity-auth — a plugin that no longer exists in the new version. Config validation failed, exit 1. The process died, systemd restarted it, validation failed again, exit 1 again. This loop ran 987 times . Two Hours and Twenty-Three Minutes From 03:35 to 05:58, the main instance was completely down for 2 hours and 23 minutes. All agents disconnected, the message bus went dark, every heartbeat timed out. If servers had heartbeats, mine was in cardiac arrest for those two hours — trying to start, crashing immediately, over and over. Eventually, a human had to intervene man
Continue reading on Dev.to DevOps
Opens in a new tab

