Why I Split Minutes Prediction Into Two Models Instead of One

A single regression model trained on NBA game logs predicts that Joel Embiid will play 11 minutes in a game where he's listed as OUT. The model has never seen a confident zero. Every row in the training data has some minutes played, because the standard NBA API endpoint only returns logs for games where a player was active. The model knows what 28 minutes looks like and what 34 minutes looks like. It has no idea what zero looks like. This is the root problem behind the two-stage minutes engine in CourtVision. The Training Data Gap The NBA API's PlayerGameLog endpoint returns one row per game for every game a player appeared in. If Embiid sits, there's no row. If Tyrese Maxey plays 38 minutes, there's a row. The dataset is survivor-biased: it only contains games where players actually played. Train a regressor on this dataset, feed it features for a player who's clearly going to sit, and the model interpolates. It finds the nearest neighborhood in the feature space and returns a plausib

Why I Split Minutes Prediction Into Two Models Instead of One

Related Articles

Best Theraguns and Therabody Tools for Smarter Recovery (2026)

10 Top Software Medium Publications 2026

Shamir’s Secret Sharing (Explanation, Cryptography, Math, and Scripts)

Day 13. I Built the Systems. Now I Scale.

dotenv Hasn’t Evolved in 10 Years. So I Built Something Better.

Related Articles

News
Best Theraguns and Therabody Tools for Smarter Recovery (2026)
Wired • 3h ago

News
10 Top Software Medium Publications 2026
Medium Programming • 3h ago

News
Shamir’s Secret Sharing (Explanation, Cryptography, Math, and Scripts)
Medium Programming • 4h ago

News
Day 13. I Built the Systems. Now I Scale.
Medium Programming • 4h ago

News
dotenv Hasn’t Evolved in 10 Years. So I Built Something Better.
Medium Programming • 4h ago