DSpace at KOASAS: Learning-based image synthesis with disentangled representations

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Learning-based image synthesis with disentangled representations분해 표현을 이용한 학습 기반 이미지 합성

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 182
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Dae-Shik Kim	-
dc.contributor.advisor	김대식	-
dc.contributor.advisor	Soo-Young Lee	-
dc.contributor.advisor	이수영	-
dc.contributor.author	Kim, Bo-Kyeong	-
dc.date.accessioned	2021-05-12T19:42:22Z	-
dc.date.available	2021-05-12T19:42:22Z	-
dc.date.issued	2020	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=915168&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/284277	-
dc.description	학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2020.2,[vi, 97 p. :]	-
dc.description.abstract	A disentangled representation separates the explanatory generative factors of data within the representation, offering desirable properties such as interpretability and controllability. Recent methods for unsupervised disentanglement learning show their promise on simple data but often yield unsatisfactory results on real-world complex data. This issue can be alleviated by incorporating human prior knowledge or additional learning objectives into the disentangling process, which is explored in this dissertation. We propose two disentanglement learning methods with (1) shape supervision and (2) category supervision and employ them for image synthesis. For virtual clothing try-on (VTO) applications, the first method synthesizes clothing segments via disentangling their underlying factors (i.e., shape and style). An encoder separates style features from shape features that are defined as the foreground masks of segments. A generator combines these features to produce clothing segments, which are further superimposed on person images for try-on. Moreover, we propose an evaluation metric to assess how well the generator synthesizes styles. Unlike recent VTO works with full-image synthesis, our disentangling strategy enables segment-level synthesis and yields several benefits including accurate style expression and easy data collection. Experiments on fashion-parsing datasets and a VTO benchmark show the generation of high-quality clothing segments and the superiority of our method over existing synthesis methods. Additionally, we compare our method with neural style transfer and visualize the different concepts of style.For controllable image synthesis, the second method separates the generative factors of images (i.e., content and style) into two latent vectors in a variational autoencoder. Under class supervision with partially available labels, one vector captures content factors relevant to the classification. The other vector captures style factors related to the remaining variation. This separation is boosted by a learning objective to encourage statistical independence between the vectors, called vector independence. We reveal that (i) this independence term exists in decomposing the evidence lower bound with two latent vectors, and (ii) penalizing this term along with the total correlation leads to good disentanglement learning. Experiments on MNIST and Fashion-MNIST datasets demonstrate the effectiveness of our method for improving image classification and synthesis. Furthermore, experiments on dSprites dataset quantitatively show the relation between vector independence and disentanglement. We believe that this research contributes to the advancement of learning disentangled representations and improving controllability of machine learning methods.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	disentanglement learning▼adisentangled representations▼aimage synthesis▼aneural network▼avariational autoencoder▼asemi-supervised learning▼avector independence▼avirtual try-on	-
dc.subject	분해 표현 학습▼a분해 표현▼a이미지 합성▼a신경회로망▼a변분 오토인코더▼a준 지도 학습▼a벡터 독립성▼a가상 옷입히기	-
dc.title	Learning-based image synthesis with disentangled representations	-
dc.title.alternative	분해 표현을 이용한 학습 기반 이미지 합성	-
dc.type	Thesis(Ph.D)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	김보경	-

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Learning-based image synthesis with disentangled representations분해 표현을 이용한 학습 기반 이미지 합성

KOASAS

Communities & Collections