Statistical approaches to acoustic modeling in speech recognition based on discrete HMM이산분포 HMM을 이용한 음성인식에서의 음향학적 모델링을 위한 통계적 접근방법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 342
  • Download : 0
Recently, hidden Markov model (HMM) has become the predominant approach to speech recognition. The performance of an HMM-based recognizer depends on the acoustic modeling techniques adopted to represent speech characteristics. In this dissertation work, we present various methods to improve acoustic modeling capability in a discrete HMM-based speech recognition system. In order to evaluate the recognition performance for each method, we established two kinds of baseline speech recognition systems. One is an isolated word recognition system in which the vocabulary consists of 75 words, and the other is a continuous speech recognition system with a vocabulary of 102 words. First, we study various methods to estimate robust output probability distributions (PD``s) with insufficient training data. We take the approach to interpolate an output PD with some general PD``s which are able to strengthen the robustness with respect to unseen data. For this approach, we propose a statistically reliable deleted interpolation (DI) method which improves the DI technique conventionally used for robust PD interpolation. On the other hand, in order to enhance the robustness of a PD against unseen data, we study two approaches: smoothing and tying. We introduce a new algorithm to smooth a PD where the smoothing matrix is estimated by following the strategy of cross-validation as in DI. Furthermore, the smoothing method called the deleted smoothing (DS) method resembles the probabilistic mapping used for speaker adaptation. As for tying, we derive a number of state clusters based on a clustering tree which achieves a good compromise between robustness and detail of the tied PD``s in specifying speech characteristics. In addition to providing an efficient method for constructing the clustering tree, we propose a measure which accounts for the variability of estimated PD``s under various situations. In isolated word recognition experiments, the proposed methods reduce the error rate...
Advisors
Un, Chong-Kwanresearcher은종관researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
1994
Identifier
69666/325007 / 000885044
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1994.2, [ vi, 130 p. ]

URI
http://hdl.handle.net/10203/36238
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=69666&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0