DSpace at KOASAS: Adaptive maximum entropy regularization for connectionist temporal classification

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Adaptive maximum entropy regularization for connectionist temporal classification연결주의적 시간 분류의 개선을 위한 적응형 최대 엔트로피 정규화

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 3
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	유창동	-
dc.contributor.author	Eom, SooHwan	-
dc.contributor.author	엄수환	-
dc.date.accessioned	2024-07-30T19:31:35Z	-
dc.date.available	2024-07-30T19:31:35Z	-
dc.date.issued	2024	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097204&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/321632	-
dc.description	학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iv, 44 p. :]	-
dc.description.abstract	This dissertation focuses on Connectionist Temporal Classification (CTC), a fundamental sequence-to-sequence learning method that leverages dynamic programming for mapping input to output sequences. While CTC has played a pivotal role in sequence learning tasks such as automatic speech recognition (ASR) and optical character recognition (OCR), it is hindered by a persistent challenge—its tendency to generate overly narrow output predictions. To mitigate this challenge EnCTC incorporated an entropy maximization-based regularization term alongside the CTC loss. While EnCTC demonstrated its effectiveness in optical character recognition, it introduced a constant weighting factor for the regularization term during training, which could enforce unnecessary ambiguity even for correct predictions in the later stages of training and affect the overall performance. To address this issue, we present Adaptive Maximum Entropy Regularization (AdaMER), a novel approach that dynamically adjusts the impact of entropy regularization throughout the training process. This adjustment is achieved through the use of a gradient-based learnable parameter that serves as the regularization weighting factor. Our experiments, conducted on the LibriSpeech corpus and various OCR benchmark real-world datasets, provide empirical evidence of the efficacy of AdaMER in addressing the challenges associated with CTC-based sequence learning, ultimately improving model performance.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Artificial Intelligence▼aDeep Learning▼aAutomatic Speech Recognition▼aOptical Character Recognition▼aConnectionist Temporal Classification	-
dc.subject	인공 지능▼a심층 학습▼a음성 인식▼a문자 인식▼a연결주의적 시간 분류	-
dc.title	Adaptive maximum entropy regularization for connectionist temporal classification	-
dc.title.alternative	연결주의적 시간 분류의 개선을 위한 적응형 최대 엔트로피 정규화	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	Yoo, Chang D	-

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Adaptive maximum entropy regularization for connectionist temporal classification연결주의적 시간 분류의 개선을 위한 적응형 최대 엔트로피 정규화

KOASAS

Communities & Collections