Seminar • Machine Learning • Backpropagation Beyond the Gradient
Please note: This seminar will take place in DC 2585.
Felix Dangel, Postdoctoral Researcher
Vector Institute for Artificial Intelligence
Popular deep learning frameworks prioritize computing the average mini-batch gradient. Yet, other quantities such as its variance or many approximations to the Hessian can be computed efficiently, and at the same time as the gradient mean. They are of great interest to researchers and practitioners, but implementing them is often burdensome or inefficient.