Relation information extraction using a comprehensive representation scheme: applications to oncology포괄적 표현법을 활용한 관계 정보 추출: 종양학에의 응용

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 448
  • Download : 0
Information extraction (IE) is a task of identifying relevant information from input text and producing structured data as output. While explicit expressions describing the target information are the basis for the development of IE systems, in-depth analysis of the input text becomes necessary when the information is conveyed implicitly in the text. In this dissertation, we address a specialized IE method for gene-cancer relations conveyed implicitly in biomedical text. Automatic identification of gene-cancer relations from a large volume of biomedical text is an important task for cancer research, since changes in genes are known to be the main cause of oncogenesis. In particular, it is essential to understand how a gene affects a cancer and to classify genes into oncogenes (genes that cause cancers), tumor suppressor genes (genes that protect cells from cancers) and biomarkers (genes that indicate normal or cancerous states), since such classification facilitates the process of treatment and diagnosis method development. However, despite the high volume of information on such gene classes that is conveyed implicitly with detailed descriptions about gene and cancer properties, there is not yet an IE system that is targeted at such implicit information. In this dissertation, we claim that in order to classify genes into candidates of oncogenes, tumor suppressor genes and biomarkers, gene-cancer relations described in biomedical text must be characterized with 1) how a gene changes; 2) how a cancer changes; and 3) the causality between the gene and the cancer. We propose a comprehensive representation scheme that identifies gene-cancer relations upon the three aspects above and use it for developing an advanced text mining system for oncogenes, tumor suppressor genes and biomarkers. The proposed representation scheme is shown to be adequate enough to describe the set of information that can be identified objectively from biomedical text, giving rise to an ann...
Advisors
Park, Jong-Cheolresearcher박종철
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
2014
Identifier
591849/325007  / 020057498
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학과, 2014.8, [ v, 71 p. ]

Keywords

Information Extraction; 바이오마커; 암억제유전자; 암유발유전자; 관계 정보; 유전자; Cancer; Gene; Relation Information; Oncogene; Tumor suppressor gene; Biomarker; 정보 추출; 암

URI
http://hdl.handle.net/10203/197828
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=591849&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0