Can Statcast Data Improve MLB Player Performance Predictions? — Beating Marcel with LightGBM

Introduction This article is a continuation of my NPB Bayesian prediction series. Along the way, I reached a conclusion: "Without tracking data like Statcast, we can't break through the next wall." In my NPB project, I added Bayesian regression (Stan/Ridge) on top of Marcel projections. At the player level there was consistent improvement (p=0.06), but at the team level the gains disappeared. The reason: Marcel's 3-year weighted average is already accurate for high-PA regulars, leaving no margin for improvement using only aggregate stats like K%/BB%/BABIP. MLB has Statcast . This article tests whether Statcast tracking features can beat Marcel. GitHub : https://github.com/yasumorishima/baseball-mlops Streamlit : https://baseball-mlops.streamlit.app/ What is Marcel? Marcel is a simple projection system from the 1980s: weighted average of the past 3 years (weights 5:4:3) + regression to the mean + age adjustment. Despite its simplicity, it's remarkably accurate — especially for regular p

Can Statcast Data Improve MLB Player Performance Predictions? — Beating Marcel with LightGBM

Related Articles

My first Medium article

X is testing a new ad format that connects posts with products

Mycelium Framework

Life EV officially owns Rad Power Bikes now

Claude’s Cycles

Related Articles

News
My first Medium article
Medium Programming • 4h ago

News
X is testing a new ad format that connects posts with products
TechCrunch • 4h ago

News
Mycelium Framework
Lobsters • 5h ago

News
Life EV officially owns Rad Power Bikes now
TechCrunch • 5h ago

News
Claude’s Cycles
Medium Programming • 5h ago