DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 유창동 | - |
dc.contributor.author | Eom, SooHwan | - |
dc.contributor.author | 엄수환 | - |
dc.date.accessioned | 2024-07-30T19:31:35Z | - |
dc.date.available | 2024-07-30T19:31:35Z | - |
dc.date.issued | 2024 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097204&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/321632 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iv, 44 p. :] | - |
dc.description.abstract | This dissertation focuses on Connectionist Temporal Classification (CTC), a fundamental sequence-to-sequence learning method that leverages dynamic programming for mapping input to output sequences. While CTC has played a pivotal role in sequence learning tasks such as automatic speech recognition (ASR) and optical character recognition (OCR), it is hindered by a persistent challenge—its tendency to generate overly narrow output predictions. To mitigate this challenge EnCTC incorporated an entropy maximization-based regularization term alongside the CTC loss. While EnCTC demonstrated its effectiveness in optical character recognition, it introduced a constant weighting factor for the regularization term during training, which could enforce unnecessary ambiguity even for correct predictions in the later stages of training and affect the overall performance. To address this issue, we present Adaptive Maximum Entropy Regularization (AdaMER), a novel approach that dynamically adjusts the impact of entropy regularization throughout the training process. This adjustment is achieved through the use of a gradient-based learnable parameter that serves as the regularization weighting factor. Our experiments, conducted on the LibriSpeech corpus and various OCR benchmark real-world datasets, provide empirical evidence of the efficacy of AdaMER in addressing the challenges associated with CTC-based sequence learning, ultimately improving model performance. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Artificial Intelligence▼aDeep Learning▼aAutomatic Speech Recognition▼aOptical Character Recognition▼aConnectionist Temporal Classification | - |
dc.subject | 인공 지능▼a심층 학습▼a음성 인식▼a문자 인식▼a연결주의적 시간 분류 | - |
dc.title | Adaptive maximum entropy regularization for connectionist temporal classification | - |
dc.title.alternative | 연결주의적 시간 분류의 개선을 위한 적응형 최대 엔트로피 정규화 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | Yoo, Chang D | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.