DC Field | Value | Language |
---|---|---|
dc.contributor.author | Han, Seung-Ho | ko |
dc.contributor.author | Kwon, Min-Su | ko |
dc.contributor.author | Choi, Ho-Jin | ko |
dc.date.accessioned | 2020-10-08T00:55:05Z | - |
dc.date.available | 2020-10-08T00:55:05Z | - |
dc.date.created | 2020-09-21 | - |
dc.date.created | 2020-09-21 | - |
dc.date.created | 2020-09-21 | - |
dc.date.issued | 2020-07 | - |
dc.identifier.citation | JOURNAL OF ENGINEERING-JOE, v.2020, no.13, pp.589 - 594 | - |
dc.identifier.issn | 2051-3305 | - |
dc.identifier.uri | http://hdl.handle.net/10203/276483 | - |
dc.description.abstract | This article presents an eXplainable AI (XAI) approach to image captioning. Recently, deep learning techniques have been intensively used to this task with relatively good performance. Due to the 'black-box' paradigm of deep learning, however, existing approaches are unable to provide clues to explain the reasons why specific words have been selected when generating captions for given images, hence leading to generate absurd captions occasionally. To overcome this problem, this article proposes an explainable image captioning model, which provides a visual link between the region of an object (or a concept) in the given image and the particular word (or phrase) in the generated sentence. The model has been evaluated with two datasets, MSCOCO and Flickr30K, and both quantitative and qualitative results are presented to show the effectiveness of the proposed model. | - |
dc.language | English | - |
dc.publisher | INST ENGINEERING TECHNOLOGY-IET | - |
dc.title | EXplainable AI (XAI) approach to image captioning | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.citation.volume | 2020 | - |
dc.citation.issue | 13 | - |
dc.citation.beginningpage | 589 | - |
dc.citation.endingpage | 594 | - |
dc.citation.publicationname | JOURNAL OF ENGINEERING-JOE | - |
dc.identifier.doi | 10.1049/joe.2019.1217 | - |
dc.contributor.localauthor | Choi, Ho-Jin | - |
dc.contributor.nonIdAuthor | Kwon, Min-Su | - |
dc.description.isOpenAccess | Y | - |
dc.type.journalArticle | Article; Proceedings Paper | - |
dc.subject.keywordAuthor | learning (artificial intelligence) | - |
dc.subject.keywordAuthor | natural language processing | - |
dc.subject.keywordAuthor | text analysis | - |
dc.subject.keywordAuthor | neural nets | - |
dc.subject.keywordAuthor | image processing | - |
dc.subject.keywordAuthor | computer vision | - |
dc.subject.keywordAuthor | XAI | - |
dc.subject.keywordAuthor | eXplainable AI approach | - |
dc.subject.keywordAuthor | deep learning techniques | - |
dc.subject.keywordAuthor | black-box paradigm | - |
dc.subject.keywordAuthor | explainable image captioning model | - |
dc.subject.keywordAuthor | absurd caption generation | - |
dc.subject.keywordAuthor | visual link | - |
dc.subject.keywordAuthor | MSCOCO dataset | - |
dc.subject.keywordAuthor | Flickr30K dataset | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.