
How-ToProgramming Languages
Part 5: Gradient Descent in the Trenches — Optimizing and Tying Weights
via Medium PythonDaniel Kolawole Aina
How I used multivariable calculus and architectural hacks to shrink parameter bloat and accelerate learning. Continue reading on Medium »
Continue reading on Medium Python
Opens in a new tab
4 views


