The Case for Leaky Locks: Redis TTL as Failure Cooldown for Expensive AI Jobs

I'm probably not the only one who's been told "always release your locks in a finally block." It's one of those conventions we follow without much thought, and for most situations it's completely right. But I recently ran into a case where doing the opposite was actually the better call. The Problem I Didn't See Coming: How Releasing Locks Cost Me Money My job queue was simple: user submits a document → AI evaluates it → result gets stored. The issue was that AI calls can fail. Rarely, but they do. Out of nowhere, the model might ignore the expected output format or a rate limit kicks in. So I'd catch the exception, log it, mark the job as failed, and very responsibly release the lock in the finally block. Then the user would hit retry. And the AI would fail again. And they'd hit retry again. Each retry triggered another LLM call, and each one cost real money. What I had was essentially a retry storm hitting my API at exactly the moment my system was already struggling. Using Lock Expi

The Case for Leaky Locks: Redis TTL as Failure Cooldown for Expensive AI Jobs

Related Articles

Plans to possibly retire the big-endian PowerPC/POWER platforms

Why Claude Code Gets Worse the Longer You Use It.

The Power of Small Steps

Stop Overpaying for Inference: The 1B Speech Model That Runs Locally and Outperforms 8B…

An ode to bzip

Related Articles

News
Plans to possibly retire the big-endian PowerPC/POWER platforms
Lobsters • 17m ago

News
Why Claude Code Gets Worse the Longer You Use It.
Medium Programming • 1h ago

News
The Power of Small Steps
Medium Programming • 2h ago

News
Stop Overpaying for Inference: The 1B Speech Model That Runs Locally and Outperforms 8B…
Medium Programming • 3h ago

News
An ode to bzip
Lobsters • 4h ago