Relation information extraction using a comprehensive representation scheme: applications to oncology = 포괄적 표현법을 활용한 관계 정보 추출: 종양학에의 응용

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 352
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorPark, Jong-Cheol-
dc.contributor.advisor박종철-
dc.contributor.authorLee, Hee-Jin-
dc.contributor.author이희진-
dc.date.accessioned2015-04-23T08:30:39Z-
dc.date.available2015-04-23T08:30:39Z-
dc.date.issued2014-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=591849&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/197828-
dc.description학위논문(박사) - 한국과학기술원 : 전산학과, 2014.8, [ v, 71 p. ]-
dc.description.abstractInformation extraction (IE) is a task of identifying relevant information from input text and producing structured data as output. While explicit expressions describing the target information are the basis for the development of IE systems, in-depth analysis of the input text becomes necessary when the information is conveyed implicitly in the text. In this dissertation, we address a specialized IE method for gene-cancer relations conveyed implicitly in biomedical text. Automatic identification of gene-cancer relations from a large volume of biomedical text is an important task for cancer research, since changes in genes are known to be the main cause of oncogenesis. In particular, it is essential to understand how a gene affects a cancer and to classify genes into oncogenes (genes that cause cancers), tumor suppressor genes (genes that protect cells from cancers) and biomarkers (genes that indicate normal or cancerous states), since such classification facilitates the process of treatment and diagnosis method development. However, despite the high volume of information on such gene classes that is conveyed implicitly with detailed descriptions about gene and cancer properties, there is not yet an IE system that is targeted at such implicit information. In this dissertation, we claim that in order to classify genes into candidates of oncogenes, tumor suppressor genes and biomarkers, gene-cancer relations described in biomedical text must be characterized with 1) how a gene changes; 2) how a cancer changes; and 3) the causality between the gene and the cancer. We propose a comprehensive representation scheme that identifies gene-cancer relations upon the three aspects above and use it for developing an advanced text mining system for oncogenes, tumor suppressor genes and biomarkers. The proposed representation scheme is shown to be adequate enough to describe the set of information that can be identified objectively from biomedical text, giving rise to an ann...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectInformation Extraction-
dc.subject바이오마커-
dc.subject암억제유전자-
dc.subject암유발유전자-
dc.subject관계 정보-
dc.subject유전자-
dc.subjectCancer-
dc.subjectGene-
dc.subjectRelation Information-
dc.subjectOncogene-
dc.subjectTumor suppressor gene-
dc.subjectBiomarker-
dc.subject정보 추출-
dc.subject-
dc.titleRelation information extraction using a comprehensive representation scheme: applications to oncology = 포괄적 표현법을 활용한 관계 정보 추출: 종양학에의 응용-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN591849/325007 -
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid020057498-
dc.contributor.localauthorPark, Jong-Cheol-
dc.contributor.localauthor박종철-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0