DSpace at KOASAS: Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

Cited 16 time in

Cited 0 time in

Hit : 101
Download : 0

Export

Jung,Yunjae / Kim, Dahun / Woo, Sanghyun / Kim, Kyungsu / Kim, Sungjin / Kweon, In-So researcher

Visual storytelling is a task of creating a short story based on photo streams. Unlike existing visual captioning, storytelling aims to contain not only factual descriptions, but also human-like narration and semantics. However, the VIST dataset consists only of a small, fixed number of photos per story. Therefore, the main challenge of visual storytelling is to fill in the visual gap between photos with narrative and imaginative story. In this paper, we propose to explicitly learn to imagine a storyline that bridges the visual gap. During training, one or more photos is randomly omitted from the input stack, and we train the network to produce a full plausible story even with missing photo(s). Furthermore, we propose for visual storytelling a hide-and-tell model, which is designed to learn non-local relations across the photo streams and to refine and improve conventional RNN-based models. In experiments, we show that our scheme of hide-and-tell, and the network design are indeed effective at storytelling, and that our model outperforms previous state-of-the-art methods in automatic metrics. Finally, we qualitatively show the learned ability to interpolate storyline over visual gaps.

Publisher: Association for the Advancement of Artificial Intelligence

Issue Date: 2020-02

Language: English

Citation: 34th AAAI Conference on Artificial Intelligence, AAAI 2020, pp.11213 - 11220

ISSN: 2159-5399

URI: http://hdl.handle.net/10203/278542

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 16 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

This item is cited by other documents in WoS

KOASAS

Communities & Collections