DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 주영석 | - |
dc.contributor.advisor | Ju, Young Seok | - |
dc.contributor.advisor | 한동수 | - |
dc.contributor.author | Jung, Young-mok | - |
dc.contributor.author | 정영목 | - |
dc.date.accessioned | 2024-08-08T19:31:31Z | - |
dc.date.available | 2024-08-08T19:31:31Z | - |
dc.date.issued | 2024 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1100029&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/322129 | - |
dc.description | 학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[vi, 69 p. :] | - |
dc.description.abstract | As DNA sequencing technologies advance, the need for precise yet cost-effective genome analysis pipelines becomes increasingly vital. This dissertation unveils novel methodologies leveraging artificial intelligence (AI) and machine learning (ML) to enhance the two most critical steps in the genome analysis pipeline: read alignment and variant calling. Initially, we present BWA-MEME, an ML-augmented read alignment software. By employing learned indices, this software enhances the exact match search during the seeding phase—addressing a significant bottleneck in short-read alignment. Subsequently, we address challenges inherent to deep learning-based variant callers. These challenges encompass their reliance on vast labeled datasets and their susceptibility to diverse error profiles presented by different sequencing techniques. We devise a semi-supervised training approach that not only utilizes unlabeled data to learn error profiles but also incorporates a domain adaptation technique to minimize discrepancies arising from diverse error profiles. Together, these methods carve out novel pathways in read alignment and variant calling, underscoring the transformative potential of AI and ML within the genome analysis pipeline. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | 인공지능▼a머신러닝▼a고성능 컴퓨팅▼a유전체▼a바이오인포매틱스 | - |
dc.subject | Artificial intelligence▼aMachine learning▼aHigh-performance computing▼aGenomics▼aBioinformatics | - |
dc.title | Enhancing genome analysis pipeline with AI and ML | - |
dc.title.alternative | 인공지능/머신러닝을 이용한 유전체 분석 파이프라인 향상에 관한 연구 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | Han, Dongsu | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.