Seminar on Stochastics and PDEs: Stefan Perko

Event information

Event date
-
Event type
Public lectures, seminars and round tables
Event language
English
Event accessibility
Event space is accessible for all
Event payment
Free of charge
Event location category
Mattilanniemi

Title: Towards diffusion approximations for stochastic gradient descent without replacement
 

Abstract: Stochastic gradient descent without replacement (SGDo) is predominantly used to train machine learning models in practice. However, the mathematical theory of this algorithm remains underexplored compared to its "with replacement" and "one-pass" counterparts. We propose a stochastic, continuous-time approximation to SGDo based on a family of stochastic differential equations driven by a stochastic process we call an epoched Brownian motion, which encapsulates the behavior of reusing the same data points in subsequent epochs. We investigate this diffusion approximation by considering an application of SGDo to linear regression. Explicit convergence results are derived for constant learning rates and a sequence of learning rates satisfying the Robbins-Monro conditions. Finally, the validity of continuous-time dynamics are further substantiated by numerical experiments.

Add to calendar