Relation extraction from imbalanced data by independently estimating linguistic expressions언어적 표현의 독립적인 확률 추정을 통한 불균형 데이터로부터의 관계추출

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 475
  • Download : 0
Supervised Relation Extraction (RE) often comes with imbalanced datasets. Although advanced classifiers are achieving great performance on well-prepared datasets, it is hard to expect equivalent results on severely skewed data; underrepresented relation classes can be neglected. Data imbalance problem is more severe when a dataset contains many relations because most relations are in the long tail. This paper proposes an RE algorithm to learn from a dataset with imbalance. The algorithm independently estimates a probability (sample precision) of a pattern of dependency graph to express a certain relation. For efficiency, lattices are expanded with reasonable stopping conditions to collect dependency patterns. Since probabilities are independently estimated, it can be more resistant against the data imbalance problem. This paper includes the system description and an experiment to compare resistance with a standard classifier.
Advisors
Choi, Key Sunresearcher최기선researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2017
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2017.8,[iii, 20 p. :]

Keywords

Relation Extraction; Natural Language Processing; 관계추출; 자연어처리

URI
http://hdl.handle.net/10203/243455
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=718735&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0