DSpace at KOASAS: Pathwise gradient estimators for various probability distributions in deep generative models

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Industrial and Systems Engineering(산업및시스템공학과)IE-Theses_Ph.D.(박사논문)

Pathwise gradient estimators for various probability distributions in deep generative models딥 생성 모델에서의 다양한 확률 분포에 대한 경로별 경사 추정자

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 321
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Moon, Il-Chul	-
dc.contributor.advisor	문일철	-
dc.contributor.author	Joo, Weonyoung	-
dc.date.accessioned	2021-05-12T19:42:42Z	-
dc.date.available	2021-05-12T19:42:42Z	-
dc.date.issued	2020	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=924242&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/284294	-
dc.description	학위논문(박사) - 한국과학기술원 : 산업및시스템공학과, 2020.8,[vii, 64 p. :]	-
dc.description.abstract	Estimating the gradients of stochastic nodes is one of the crucial research questions in the deep generative modeling community to optimize the model parameters through gradient descent method. This dissertation discusses two types of pathwise gradient estimators: one for Dirichlet distribution, and the other for generic discrete distributions. In our first work, we propose Dirichlet Variational Autoencoder (DirVAE) using a Dirichlet prior. To infer the parameters of DirVAE, we develop the pathwise gradient estimator by approximating the inverse cumulative distribution function of the Gamma distribution, which is a component of the Dirichlet distribution. This approximation on a new prior led an investigation on the component collapsing, and DirVAE revealed that the component collapsing originates from two problem sources: decoder weight collapsing and latent value collapsing. By resolving the component collapsing problem with the Dirichlet prior, we show that DirVAE produces disentangled latent representation which leads to the significant performance gain. Comparing to the continuous case, the gradient estimation problem becomes further complex when we regard the stochastic nodes to be discrete because pathwise derivative techniques can not be applied. Hence, the gradient estimation requires the score function methods or the continuous relaxation of the discrete random variables. In our second work, we suggest a general version of the Gumbel-Softmax estimator with continuous relaxation, and this estimator is able to relax the discreteness of probability distributions, including broader types than the current practice. In detail, we utilize the truncation of discrete random variables and the Gumbel-Softmax trick with a linear transformation for the relaxation. The proposed approach enables the relaxed discrete random variable to be reparameterized and to backpropagate through a large scale stochastic neural network.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Deep Generative Model▼aVariational Autoencoder▼aPathwise Gradient Estimator▼aReparameterization Trick▼aRepresentation Learning	-
dc.subject	딥 생성 모델▼a변분 오토인코더▼a경로별 경사 추정자▼a재매개화 기교▼a표현 학습	-
dc.title	Pathwise gradient estimators for various probability distributions in deep generative models	-
dc.title.alternative	딥 생성 모델에서의 다양한 확률 분포에 대한 경로별 경사 추정자	-
dc.type	Thesis(Ph.D)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :산업및시스템공학과,	-
dc.contributor.alternativeauthor	주원영	-

Appears in Collection: IE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Pathwise gradient estimators for various probability distributions in deep generative models딥 생성 모델에서의 다양한 확률 분포에 대한 경로별 경사 추정자

KOASAS

Communities & Collections