Adaptive maximum entropy regularization for connectionist temporal classification연결주의적 시간 분류의 개선을 위한 적응형 최대 엔트로피 정규화

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor유창동-
dc.contributor.authorEom, SooHwan-
dc.contributor.author엄수환-
dc.date.accessioned2024-07-30T19:31:35Z-
dc.date.available2024-07-30T19:31:35Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097204&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321632-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iv, 44 p. :]-
dc.description.abstractThis dissertation focuses on Connectionist Temporal Classification (CTC), a fundamental sequence-to-sequence learning method that leverages dynamic programming for mapping input to output sequences. While CTC has played a pivotal role in sequence learning tasks such as automatic speech recognition (ASR) and optical character recognition (OCR), it is hindered by a persistent challenge—its tendency to generate overly narrow output predictions. To mitigate this challenge EnCTC incorporated an entropy maximization-based regularization term alongside the CTC loss. While EnCTC demonstrated its effectiveness in optical character recognition, it introduced a constant weighting factor for the regularization term during training, which could enforce unnecessary ambiguity even for correct predictions in the later stages of training and affect the overall performance. To address this issue, we present Adaptive Maximum Entropy Regularization (AdaMER), a novel approach that dynamically adjusts the impact of entropy regularization throughout the training process. This adjustment is achieved through the use of a gradient-based learnable parameter that serves as the regularization weighting factor. Our experiments, conducted on the LibriSpeech corpus and various OCR benchmark real-world datasets, provide empirical evidence of the efficacy of AdaMER in addressing the challenges associated with CTC-based sequence learning, ultimately improving model performance.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectArtificial Intelligence▼aDeep Learning▼aAutomatic Speech Recognition▼aOptical Character Recognition▼aConnectionist Temporal Classification-
dc.subject인공 지능▼a심층 학습▼a음성 인식▼a문자 인식▼a연결주의적 시간 분류-
dc.titleAdaptive maximum entropy regularization for connectionist temporal classification-
dc.title.alternative연결주의적 시간 분류의 개선을 위한 적응형 최대 엔트로피 정규화-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorYoo, Chang D-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0