Deep Learning

Deep Learning - Auto Differentiator From Scratch

Auto Diff Is The Dark Magic Of All Dark Magics Of Deep Learning

Posted by Rico's Nerd Cluster on January 6, 2022

Introduction

Gradients here refer to scalar to matrix gradient.
We need to accumulate gradients for mini-batch training.

Elementwise Multiplication gradients: A * B = C

del C / del A_ij = B_ij -> del C/ del A = B