DSpace at KOASAS: Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis

Cited 184 time in

Cited 181 time in

Hit : 233
Download : 0

Export

Hong, Seunghoon researcher / Yang, Dingdong / Choi, Jongwook / Lee, Honglak

We propose a novel hierarchical approach for text-to-image synthesis by inferring semantic layout. Instead of learning a direct mapping from text to image, our algorithm decomposes the generation process into multiple steps, in which it first constructs a semantic layout from the text by the layout generator and converts the layout to an image by the image generator. The proposed layout generator progressively constructs a semantic layout in a coarse-to-fine manner by generating object bounding boxes and refining each box by estimating object shapes inside the box. The image generator synthesizes an image conditioned on the inferred semantic layout, which provides a useful semantic structure of an image matching with the text description. Our model not only generates semantically more meaningful images, but also allows automatic annotation of generated images and user-controlled generation process by modifying the generated scene layout. We demonstrate the capability of the proposed model on challenging MS-COCO dataset and show that the model can substantially improve the image quality, interpretability of output and semantic alignment to input text over existing approaches.

Publisher: IEEE Computer Society

Issue Date: 2018-06-18

Language: English

Citation: 31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018, pp.7986 - 7994

DOI: 10.1109/CVPR.2018.00833

URI: http://hdl.handle.net/10203/269576

Appears in Collection: CS-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 184 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis

This item is cited by other documents in WoS

KOASAS

Communities & Collections