Deep Learning - Auto Differentiator From Scratch

Auto Diff Is The Dark Magic Of All Dark Magics Of Deep Learning

Posted by Rico's Nerd Cluster on January 6, 2022

Introduction

  • Gradients here refer to scalar to matrix gradient.
  • We need to accumulate gradients for mini-batch training.

Elementwise Multiplication gradients: A * B = C

  • del C / del A_ij = B_ij -> del C/ del A = B