ml ›section 01 of 14

Math Foundations

The calculus and linear algebra behind every neural net

6 lessons·4easy2medium

Lessons

in order

The workhorse optimizer — derive, implement, and visualize it.

Activation functions, their derivatives, and why ReLU won.

Turn raw logits into calibrated probabilities.

The canonical classification loss — from KL divergence down.

Predictions as matrix multiplications.

Closed-form vs iterative — when each wins.