EXplainable AI (XAI) approach to image captioning

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 504
  • Download : 457
DC FieldValueLanguage
dc.contributor.authorHan, Seung-Hoko
dc.contributor.authorKwon, Min-Suko
dc.contributor.authorChoi, Ho-Jinko
dc.date.accessioned2020-10-08T00:55:05Z-
dc.date.available2020-10-08T00:55:05Z-
dc.date.created2020-09-21-
dc.date.created2020-09-21-
dc.date.created2020-09-21-
dc.date.issued2020-07-
dc.identifier.citationJOURNAL OF ENGINEERING-JOE, v.2020, no.13, pp.589 - 594-
dc.identifier.issn2051-3305-
dc.identifier.urihttp://hdl.handle.net/10203/276483-
dc.description.abstractThis article presents an eXplainable AI (XAI) approach to image captioning. Recently, deep learning techniques have been intensively used to this task with relatively good performance. Due to the 'black-box' paradigm of deep learning, however, existing approaches are unable to provide clues to explain the reasons why specific words have been selected when generating captions for given images, hence leading to generate absurd captions occasionally. To overcome this problem, this article proposes an explainable image captioning model, which provides a visual link between the region of an object (or a concept) in the given image and the particular word (or phrase) in the generated sentence. The model has been evaluated with two datasets, MSCOCO and Flickr30K, and both quantitative and qualitative results are presented to show the effectiveness of the proposed model.-
dc.languageEnglish-
dc.publisherINST ENGINEERING TECHNOLOGY-IET-
dc.titleEXplainable AI (XAI) approach to image captioning-
dc.typeArticle-
dc.type.rimsART-
dc.citation.volume2020-
dc.citation.issue13-
dc.citation.beginningpage589-
dc.citation.endingpage594-
dc.citation.publicationnameJOURNAL OF ENGINEERING-JOE-
dc.identifier.doi10.1049/joe.2019.1217-
dc.contributor.localauthorChoi, Ho-Jin-
dc.contributor.nonIdAuthorKwon, Min-Su-
dc.description.isOpenAccessY-
dc.type.journalArticleArticle; Proceedings Paper-
dc.subject.keywordAuthorlearning (artificial intelligence)-
dc.subject.keywordAuthornatural language processing-
dc.subject.keywordAuthortext analysis-
dc.subject.keywordAuthorneural nets-
dc.subject.keywordAuthorimage processing-
dc.subject.keywordAuthorcomputer vision-
dc.subject.keywordAuthorXAI-
dc.subject.keywordAuthoreXplainable AI approach-
dc.subject.keywordAuthordeep learning techniques-
dc.subject.keywordAuthorblack-box paradigm-
dc.subject.keywordAuthorexplainable image captioning model-
dc.subject.keywordAuthorabsurd caption generation-
dc.subject.keywordAuthorvisual link-
dc.subject.keywordAuthorMSCOCO dataset-
dc.subject.keywordAuthorFlickr30K dataset-

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0