Enhancing genome analysis pipeline with AI and ML인공지능/머신러닝을 이용한 유전체 분석 파이프라인 향상에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 2
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor주영석-
dc.contributor.advisorJu, Young Seok-
dc.contributor.advisor한동수-
dc.contributor.authorJung, Young-mok-
dc.contributor.author정영목-
dc.date.accessioned2024-08-08T19:31:31Z-
dc.date.available2024-08-08T19:31:31Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1100029&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/322129-
dc.description학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[vi, 69 p. :]-
dc.description.abstractAs DNA sequencing technologies advance, the need for precise yet cost-effective genome analysis pipelines becomes increasingly vital. This dissertation unveils novel methodologies leveraging artificial intelligence (AI) and machine learning (ML) to enhance the two most critical steps in the genome analysis pipeline: read alignment and variant calling. Initially, we present BWA-MEME, an ML-augmented read alignment software. By employing learned indices, this software enhances the exact match search during the seeding phase—addressing a significant bottleneck in short-read alignment. Subsequently, we address challenges inherent to deep learning-based variant callers. These challenges encompass their reliance on vast labeled datasets and their susceptibility to diverse error profiles presented by different sequencing techniques. We devise a semi-supervised training approach that not only utilizes unlabeled data to learn error profiles but also incorporates a domain adaptation technique to minimize discrepancies arising from diverse error profiles. Together, these methods carve out novel pathways in read alignment and variant calling, underscoring the transformative potential of AI and ML within the genome analysis pipeline.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject인공지능▼a머신러닝▼a고성능 컴퓨팅▼a유전체▼a바이오인포매틱스-
dc.subjectArtificial intelligence▼aMachine learning▼aHigh-performance computing▼aGenomics▼aBioinformatics-
dc.titleEnhancing genome analysis pipeline with AI and ML-
dc.title.alternative인공지능/머신러닝을 이용한 유전체 분석 파이프라인 향상에 관한 연구-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorHan, Dongsu-
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0