23. Accelerating Gradient Descent (Use Momentum)

MIT Matrix Methods in Data Analysis, Signal Processing, and Machine Learning, Spring 2018 Instructor: Gilbert Strang View the complete course: YouTube Playlist: In this lecture, Professor Strang explains both momentum-based gradient descent and Nesterov’s accelerated gradient descent. License: Creative Commons BY-NC-SA More information at More courses at
Back to Top