High-dimensional Statistical Modeling Team Seminar (Talk by Mr. Ziyin Liu, University of Tokyo)

2021/8/11 18:17

要旨

Title: Stochastic Gradient Descent with Multiplicative Noise

Abstract:
Stochastic gradient descent (SGD) is the main optimization algorithm behind the success of deep learning. Recently, it is shown that the stochastic noise in SGD is multiplicative, i.e., the strength of the noise crucially depends on the model parameter. In this talk, we show that the dynamics of SGD can be very surprising and unintuitive when the noise is multiplicative. For example, we show that (1) SGD may converge to a local maximum; (2) SGD may escape a saddle point arbitrarily slowly; (3) SGD may prefer sharp minima over the flat ones; and (4) AMSGrad may converge to a local maximum. If time allows, we also present some recent results that shed light on how SGD works under the multiplicative noise. This presentation is mainly based on the following three works of the speaker.
[1] https://arxiv.org/abs/2107.11774
[2] https://arxiv.org/abs/2105.09557
[3] https://arxiv.org/abs/2012.03636

Bio:
Liu Ziyin. http://cat.phys.s.u-tokyo.ac.jp/~zliu/

詳細情報

日時	2021/09/21(火) 16:00 - 17:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/125701

日曜日	月曜日	火曜日	水曜日	木曜日	金曜日	土曜日
		1日のイベントページへのリンク	2日のイベントページへのリンク	3日のイベントページへのリンク	4日のイベントページへのリンク	5日
6日	7日	8日	9日のイベントページへのリンク	10日のイベントページへのリンク	11日	12日
13日	14日	15日のイベントページへのリンク	16日のイベントページへのリンク	17日のイベントページへのリンク	18日	19日
20日	21日	22日	23日	24日	25日	26日
27日	28日	29日	30日	31日

革新知能統合研究センター

イベント