Data mining for characterizing protein-protein interaction interfaces = 데이터 마이닝 기법을 적용한 단백질 분자 사이의 상호작용 특성에 대한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 449
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorLee, Kwang-H.-
dc.contributor.advisor이광형-
dc.contributor.advisorKim, Dong-Sup-
dc.contributor.advisor김동섭-
dc.contributor.authorCho, Kyu-il-
dc.contributor.author조규일-
dc.date.accessioned2011-12-12T07:25:32Z-
dc.date.available2011-12-12T07:25:32Z-
dc.date.issued2007-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=268685&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/27058-
dc.description학위논문(박사) - 한국과학기술원 : 바이오및뇌공학과, 2007. 8, [ xviii, 112 p. ]-
dc.description.abstractThe main objective of this thesis is to suggest a new perspective to effectively describe protein-protein interaction interfaces in terms of data mining. Two systematic approaches are applied to answer the question of what features are effective in representing the interaction interfaces. One is proceeded in the aspect of specificity, and the other is performed in the aspect of stability, which are two critical factors determining the binding association. Firstly, specificity of molecular interactions is examined in the context of protein functions. This is the first approach to analyze interaction interfaces at the molecular interaction level in the context of protein functions. We perform systematic analysis at the molecular interaction level using classification and feature subset selection technique prevalent in the field of pattern recognition. To represent the physicochemical properties of protein-protein interfaces, we design 18 molecular interaction types using canonical and non-canonical interactions. Then, we construct input vectors using the frequency of each interaction type in protein-protein interfaces. The 131 interfaces of transient protein-protein heterocomplexes in PDB is analyzed : 33 protease-inhibitors, 52 antibody-antigens, and 46 signaling proteins including 4 cyclin dependent kinase and 26 G-protein. Using kNN classification and feature subset selection techniques, we clearly show that there are specific interaction types according to their functional categories, and such interaction types are conserved through the common binding mechanism, rather than through the sequence or structure conservation. The $C^\alpha-H\cdots O=C$ shows binding specificity for protease-inhibitor complexes, while cation-anion interaction is predominant in signaling complexes. In the case of antibody-antigen complexes, the sign is somewhat ambiguous. From the evolutionary perspective, while protease-inhibitors and signaling proteins have optimized their int...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject데이터 마이닝-
dc.subject단백질-
dc.subject상호작용면-
dc.subject특이성-
dc.subject분자수준-
dc.subject고밀도 구조-
dc.subject단량체-
dc.subject핫스팟 아미노산-
dc.subject통계분석-
dc.subject특징추출-
dc.subject특징분류-
dc.subject주변량 분석-
dc.subjectData mining-
dc.subjectprotein-
dc.subjectinterface-
dc.subjectspecificity-
dc.subjectmolecular interaction-
dc.subjectdensely structured organization-
dc.subjectunbounded state-
dc.subjecthot spot residues-
dc.subjectMann-Whitney-
dc.subjectclassification-
dc.subjectfeature selection-
dc.subjectSVMs-
dc.subjectkNN-
dc.subjectSVD-
dc.subjectPCA-
dc.subjectDecision tree-
dc.subject데이터 마이닝-
dc.subject단백질-
dc.subject상호작용면-
dc.subject특이성-
dc.subject분자수준-
dc.subject고밀도 구조-
dc.subject단량체-
dc.subject핫스팟 아미노산-
dc.subject통계분석-
dc.subject특징추출-
dc.subject특징분류-
dc.subject주변량 분석-
dc.subjectData mining-
dc.subjectprotein-
dc.subjectinterface-
dc.subjectspecificity-
dc.subjectmolecular interaction-
dc.subjectdensely structured organization-
dc.subjectunbounded state-
dc.subjecthot spot residues-
dc.subjectMann-Whitney-
dc.subjectclassification-
dc.subjectfeature selection-
dc.subjectSVMs-
dc.subjectkNN-
dc.subjectSVD-
dc.subjectPCA-
dc.subjectDecision tree-
dc.titleData mining for characterizing protein-protein interaction interfaces = 데이터 마이닝 기법을 적용한 단백질 분자 사이의 상호작용 특성에 대한 연구-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN268685/325007 -
dc.description.department한국과학기술원 : 바이오및뇌공학과, -
dc.identifier.uid020025881-
dc.contributor.localauthorLee, Kwang-H.-
dc.contributor.localauthor이광형-
dc.contributor.localauthorKim, Dong-Sup-
dc.contributor.localauthor김동섭-
Appears in Collection
BiS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0