DSpace at KOASAS: Performance improvement of speech recognition using segmental information in speech signal

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Performance improvement of speech recognition using segmental information in speech signal음성 신호의 부분 정보를 이용한 음성인식 성능 향상

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 376
Download : 0

Export

Kim, Hoi-Rin / 김회린

In this dissertation, we propose several methods to improve recognition accuracy of a hidden Markov model(HMM)-based speech recognition system by using segmental information of speech signal. As the segmental information, we use the HMM state segments which possess common stochastic characteristics of speech signal. Using the segmental information, we propose a modified corrective training algorithm which could improve the discrimination ability of HMMs. Then a new HMM parameter estimation algorithm and a new post-processor are proposed to reduce training and recognition time as well as to improve recognition accuracy. In order to obtain benchmark performances of the proposed algorithms, we implemented two baseline speech recognition systems based on phoneme-like units: one is a speaker-dependent system for 100 phonetically-balanced Korean words and the other is a speaker-independent system for 75 phonetically-balanced Korean words. First, we present a modified corrective training algorithm using HMM state segment information. The modified corrective training method corrects the HMM parameters using the segmental k-means algorithm instead of the forward-backward algorithm used in the conventional corrective training method. It is motivated from the fact that the segmental k-means algorithm has more emphasis on the model state segment information. Applying this method to the speaker-dependent baseline system, we observe that the proposed method results in higher recognition accuracy than the conventional method. That is, the phoneme and word, recognition accuracies in the conventional method are 72.5% and 89%, respectively, and those in the proposed method are 74.9% and 93%, respectively. Also, the proposed method requires much less computation time than the conventional method in training process. Second, a fuzzy segmental k-means(FSKM) algorithm for the HMM parameter re-estimation is proposed. A fuzzy vector quantization(FVQ)-based HMM (FVQ/HMM) scheme requir...

Advisors: Lee, Hwang-Soo researcher; 이황수 researcher

Description: 한국과학기술원 : 전기및 전자공학과,

Publisher: 한국과학기술원

Issue Date: 1992

Identifier: 59818/325007 / 000855106

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기및 전자공학과, 1992.2, [ x, 105 p. ]

URI: http://hdl.handle.net/10203/35655

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=59818&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Performance improvement of speech recognition using segmental information in speech signal음성 신호의 부분 정보를 이용한 음성인식 성능 향상

KOASAS

Communities & Collections