Perspectives on the science of deep learning
Three essays on building theory that matters.
Three essays on building theory that matters.
Deep linear networks are simple enough to study analytically but rich enough to exhibit key phenomena of neural network training.
What is the origin of neural scaling laws? What do they tell us about the structure of data? What are the limits of interpretability?