DSpace at KOASAS: A study on Korean connected digit recognition and short-term cepstral mean normalization

DSpace at KOASAS

College of Engineering(공과대학)KAIST-ICC School of Engineering-Theses_Master(공학부 석사논문)

A study on Korean connected digit recognition and short-term cepstral mean normalization한국어 연속 숫자 음성 인식과 단구간 켑스트럼 평균 정규화에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 661
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Hahn, Min-Soo	-
dc.contributor.advisor	한민수	-
dc.contributor.author	Kim, Sang-Jin	-
dc.contributor.author	김상진	-
dc.date.accessioned	2011-12-28T02:54:56Z	-
dc.date.available	2011-12-28T02:54:56Z	-
dc.date.issued	2002	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392127&flag=dissertation	-
dc.identifier.uri	http://hdl.handle.net/10203/54775	-
dc.description	학위논문(석사) - 한국정보통신대학원대학교 : 공학부, 2002, [ xi, 97 p. ]	-
dc.description.abstract	Although many researchers have studied about digit recognition, it is still away from commercial applications in Korea. It is well known that Korean digit recognition is more difficult than English digit recognition, even worse in continuous digits. In this paper, I studied about various techniques to improve the recognition, especially one of the environmental compensation preprocessing methods, called the cepstral mean normalization, with some acoustic-phonetic models. I found that the recognition results varied depending on the windows size for the cepstral mean normalization, and not always the long-term cepstral mean normalization produces the best results. This can be interpreted as if we use the short-term cepstral mean normalization technique with a proper window size for Korean digit recognition, we can get the better results than the conventional cepstral mean normalization. The reason could be the variation of the phone length caused by the short-term cepstral mean normalization, and this variation is believed to improve the recognition rate. Monophone, triphone, whole-word, tri-word, and phonological-rule- considered digit models in Korean pronunciation, are tested in various numbers of states and mixtures. Mel-frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) cepstral coefficients are extracted as the feature vectors. Long-term and short-term cepstral mean normalization/ subtraction(CMN/CMS) processing, and relative spectral (RASTA) processing is used for the channel noise compensation. Kalman filtering is applied for additive noise reduction. Linear discriminant analysis (LDA) transformation for the digit recognition is also tested in the end.	eng
dc.language	eng	-
dc.publisher	한국정보통신대학원대학교	-
dc.subject	Short-Term Cepstral	-
dc.subject	Connected Digit Recognition	-
dc.subject	인식 시스템	-
dc.subject	연속 숫자 음성 인식	-
dc.subject	ST-CMN	-
dc.title	A study on Korean connected digit recognition and short-term cepstral mean normalization	-
dc.title.alternative	한국어 연속 숫자 음성 인식과 단구간 켑스트럼 평균 정규화에 관한 연구	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	392127/225023	-
dc.description.department	한국정보통신대학원대학교 : 공학부,	-
dc.identifier.uid	020003853	-
dc.contributor.localauthor	Hahn, Min-Soo	-
dc.contributor.localauthor	한민수	-

Appears in Collection: School of Engineering-Theses_Master(공학부 석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A study on Korean connected digit recognition and short-term cepstral mean normalization한국어 연속 숫자 음성 인식과 단구간 켑스트럼 평균 정규화에 관한 연구

KOASAS

Communities & Collections