DSpace at KOASAS: Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning

Cited 51 time in

Cited 37 time in

Hit : 166
Download : 0

Export

Kim, Dong-Jin / Choi, Jinsoo / Oh, Tae-Hyun / Kweon, In-So researcher

Our goal in this work is to train an image captioning model that generates more dense and informative captions. We introduce "relational captioning," a novel image captioning task which aims to generate multiple captions with respect to relational information between objects in an image. Relational captioning is a framework that is advantageous in both diversity and amount of information, leading to image understanding based on relationships. Part-of speech (POS, i.e. subject-object-predicate categories) tags can be assigned to every English word. We leverage the POS as a prior to guide the correct sequence of words in a caption. To this end, we propose a multi-task triple-stream network (MTTSNet) which consists of three recurrent units for the respective POS and jointly performs POS prediction and captioning. We demonstrate more diverse and richer representations generated by the proposed model against several baselines and competing methods.

Publisher: IEEE Conference on Computer Vision and Pattern Recognition

Issue Date: 2019-06-19

Language: English

Citation: IEEE Conference on Computer Vision and Pattern Recognition, pp.6264 - 6273

DOI: 10.1109/CVPR.2019.00643

URI: http://hdl.handle.net/10203/268690

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 51 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning

This item is cited by other documents in WoS

KOASAS

Communities & Collections