Toward advice mining: advanced conditional random fields for extracting advice-revealing text units조언 관련 텍스트 추출을 위한 진전된 조건부 랜덤 필드 방법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 447
  • Download : 0
Web forums often contain explicit key learnings gleaned from people`s experiences since they are platforms for personal communications on sharing information with others. One of the key learnings contained in Web forums is often expressed in the form of advice. In this paper, we address the problem of advice-revealing text unit (ATU) extraction from online forums due to its usefulness in travel domain. We represent an advice as a two-tuple comprising an advice-revealing sentence and its context sentences. To extract the advice-revealing sentences, we propose to define the problem as a sequence labeling problem, using three different types of features: syntactic, contextual, and semantic features. We also improve the performance using Skip-Chain CRF in which our sentence generalization method is employed to construct the skip-edges. To extract the context sentences, we propose to use 2D-CRF model, which gives the best performance compared to traditional machine learning models. Finally, we present an integrated solution to extract advice-revealing sentences and their respective context sentences at the same time using our proposed models, i.e., Multiple Linear CRF (ML-CRF) and 2 Dimensional CRF Plus (2D-CRF+). The experiment results show that ML-CRF performs better than any other models for extracting advice-revealing sentences and context sentences.
Advisors
Myaeng, Sung-Hyonresearcher맹성현
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
2013
Identifier
567075/325007  / 020114594
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학과, 2013.8, [ v, 40 p. ]

Keywords

conditional random field; text mining; 조언 관련 텍스트 추출; 조건부 랜덤 필드; advice mining; 텍스트 추출

URI
http://hdl.handle.net/10203/196878
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=567075&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0