DC Field | Value | Language |
---|---|---|
dc.contributor.author | Seong, Sihyeon | ko |
dc.contributor.author | Lee, Yekang | ko |
dc.contributor.author | Kee, Youngwook | ko |
dc.contributor.author | Han, Dongyoon | ko |
dc.contributor.author | Kim, Junmo | ko |
dc.date.accessioned | 2018-12-20T02:05:33Z | - |
dc.date.available | 2018-12-20T02:05:33Z | - |
dc.date.created | 2018-11-29 | - |
dc.date.created | 2018-11-29 | - |
dc.date.created | 2018-11-29 | - |
dc.date.issued | 2018-08-09 | - |
dc.identifier.citation | 34th Conference on Uncertainty in Artificial Intelligence (UAI), pp.1020 - 1030 | - |
dc.identifier.uri | http://hdl.handle.net/10203/247381 | - |
dc.description.abstract | Whereas optimizing deep neural networks using stochastic gradient descent has shown great performances in practice, the rule for setting step size (i.e. learning rate) of gradient descent is not well studied. Although it appears that some intriguing learning rate rules such as ADAM (Kingma and Ba, 2014) have since been developed, they concentrated on improving convergence, not on improving generalization capabilities. Recently, the improved generalization property of the flat minima was revisited, and this research guides us towards promising solutions to many current optimization problems. In this paper, we analyze the flatness of loss surfaces through the lens of robustness to input perturbations and advocate that gradient descent should be guided to reach flatter region of loss surfaces to achieve generalization. Finally, we suggest a learning rate rule for escaping sharp regions of loss surfaces, and we demonstrate the capacity of our approach by performing numerous experiments. | - |
dc.language | English | - |
dc.publisher | Association for Uncertainty in Artificial Intelligence (AUAI) | - |
dc.title | Towards Flatter Loss Surface via Nonmonotonic Learning Rate Scheduling | - |
dc.type | Conference | - |
dc.identifier.wosid | 000493119200100 | - |
dc.identifier.scopusid | 2-s2.0-85059402235 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 1020 | - |
dc.citation.endingpage | 1030 | - |
dc.citation.publicationname | 34th Conference on Uncertainty in Artificial Intelligence (UAI) | - |
dc.identifier.conferencecountry | US | - |
dc.identifier.conferencelocation | Monterey, California | - |
dc.contributor.localauthor | Kim, Junmo | - |
dc.contributor.nonIdAuthor | Kee, Youngwook | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.