The loop, the diagnostics, the first real model
Forward, loss, backward, step — the four-line core.
Loss curves, grad norms, and the art of debugging.
Find and fix silently-dying neurons.
Ship a working MNIST model end-to-end.