DSpace at KOASAS: Speech recognition using hybrid time-delay neural network/hidden markov model

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Speech recognition using hybrid time-delay neural network/hidden markov model혼성 time-delay 신경회로망/hidden markov model을 이용한 음성인식

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 400
Download : 0

Export

Jang, Choon-Seo / 장춘서

In this dissertation, we porpose a hybrid modular time-delay neural network (TDNN)/hidden Markov model(HMM) architecture and a new parameter smoothing method to increase the recognition accuracy of a speech recognition system. In order to obtain the benchmark performances of the porposed methods, we implemented two kinds of baseline speech recognition systems. One is based on HMM and the other is based on TDNN. The database consists of 75 phonetically balanced Korean words, and 44 context-independent phonemes are chosen as the recognition unit. Speaker-independent phonemebased word recognition was done in our experiments. First, a new hybrid modular TDNN/HMM architecture for the speech recognition system is proposed. In this architecture, TDnn and HMM are effectively integrated using the fuzzy mapping concept. Modular construction of TDNN is used in our system to expand the TDNN to handle all phonemes. To deal with the temporal structure of phonemic features, we divide the input layer of our modular TDNN into two states in a time sequence. The first hidden layer of each phoneme subclass network has also a tied-connected window for each state to preserve the time-shift invarance property. This structure allows the networks to capture the temporal structure of phonemic features with two feature detectors. Our system consists of 11 phoneme subclass networks and a vowel/consonant classification network. Each phoneme subclass network is trained separately using the training data from its own set only, and connection weights between layers of subclass networks are retrained. TDNN and HMM are integrated in our system by feeding outpit vectors from the second hidden layer of each phoneme subclass network to HMM. The HMM algorithm is modified to accommodate these outputs. Therefore, our system takes advantage of both TDNN and HMM, and it can treat the temporal structure of phonemic features. Simulation results shows that dividing each layer of subclass networks into two...

Advisors: Un, Chong-Kwan researcher; 은종관 researcher

Description: 한국과학기술원 : 전기 및 전자공학과,

Publisher: 한국과학기술원

Issue Date: 1993

Identifier: 68159/325007 / 000785560

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1993.8, [ v, 104 p. ]

URI: http://hdl.handle.net/10203/36185

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=68159&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Speech recognition using hybrid time-delay neural network/hidden markov model혼성 time-delay 신경회로망/hidden markov model을 이용한 음성인식

KOASAS

Communities & Collections