DSpace at KOASAS: Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Conference Papers(학술대회논문)

Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods

Cited 1 time in

Cited 0 time in

Hit : 245
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lee, Juhyuk	ko
dc.contributor.author	Lee, Min-Joong	ko
dc.contributor.author	Yang, June Yong	ko
dc.contributor.author	Yang, Eunho	ko
dc.date.accessioned	2022-12-06T01:00:45Z	-
dc.date.available	2022-12-06T01:00:45Z	-
dc.date.created	2022-12-04	-
dc.date.issued	2022-07-12	-
dc.identifier.citation	2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022, pp.3849 - 3858	-
dc.identifier.uri	http://hdl.handle.net/10203/301719	-
dc.description.abstract	The ability to extract entities and their relations from unstructured text is essential for the automated maintenance of large-scale knowledge graphs. To keep a knowledge graph up-to-date, an extractor needs not only the ability to recall the triples it encountered during training, but also the ability to extract the new triples from the context that it has never seen before. In this paper, we show that although existing extraction models are able to easily memorize and recall already seen triples, they cannot generalize effectively for unseen triples. This alarming observation was previously unknown due to the composition of the test sets of the go-to benchmark datasets, which turns out to contain only 2% unseen data, rendering them incapable to measure the generalization performance. To separately measure the generalization performance from the memorization performance, we emphasize unseen data by rearranging datasets, sifting out training instances, or augmenting test sets. In addition to that, we present a simple yet effective augmentation technique to promote generalization of existing extraction models, and experimentally confirm that the proposed method can significantly increase the generalization performance of existing models.	-
dc.language	English	-
dc.publisher	Association for Computational Linguistics	-
dc.title	Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods	-
dc.type	Conference	-
dc.identifier.wosid	000859869503073	-
dc.identifier.scopusid	2-s2.0-85138385914	-
dc.type.rims	CONF	-
dc.citation.beginningpage	3849	-
dc.citation.endingpage	3858	-
dc.citation.publicationname	2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	Seattle	-
dc.identifier.doi	10.18653/v1/2022.naacl-main.282	-
dc.contributor.localauthor	Yang, Eunho	-
dc.contributor.nonIdAuthor	Lee, Juhyuk	-
dc.contributor.nonIdAuthor	Lee, Min-Joong	-

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods

This item is cited by other documents in WoS

KOASAS

Communities & Collections