DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kil, Rhee-Man | - |
dc.contributor.advisor | 길이만 | - |
dc.contributor.author | Kim, Su-Youn | - |
dc.contributor.author | 김수연 | - |
dc.date.accessioned | 2011-12-14T04:54:08Z | - |
dc.date.available | 2011-12-14T04:54:08Z | - |
dc.date.issued | 2002 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=173583&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/42044 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 응용수학전공, 2002.2, [ [iv], 47 p. ] | - |
dc.description.abstract | Research in automatic speech recognition (ASR) has been investigated for almost four decades. Especially, the methodology of hidden Markov models (HMMs) proposed in early 1980s, and artificial neural networks (ANNs) introduced in the late 1980s, have become major tools for speech recognition. The HMMs are widely used due to a natural and highly reliable way of recognizing speech for a wide range of applications. However the models have some limitations from the strong constraints: 1) the non-discriminative training/decoding criterion such as the maximum likelihood measure, 2) arbitrary assumptions on the parametric form of probability distribution, and 3) high sensitivity to environmental conditions due to the lack of generalization. On the other hand, connectionist architectures have recently been recognized as an alternative tool for pattern classification and speech recognition problems. Their main properties are their discriminant power and capability to learn and represent implicit knowledge. However, neural networks to date are still not well-suited for dealing with time-varying input patterns and segmentation of sequential inputs. In this sense, an idea of combining HMM and ANNs within a single, novel model, broadly known as hybrid HMM/ANN, has been suggested. This thesis suggest a work in this direction, are new method of combining HMM and radial basis function network (RBFN), one of regression models in ANNs. The suggested model is trained to estimate the posterior emission probabilities of HMM states for the given an acoustic observation sequence. For the training of this model, a two-phase discriminative training technique is used to optimize the RBF parameters. In the first phase of training, the hybrid system is trained by the RLS algorithm, optimizing a mean squared error (MSE) function for the desired and actual posterior emission probabilities. This results in an approximation of posterior emission probabilities of HMM states. In the second phas... | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | hybrid HMM/RBFN | - |
dc.subject | 결합 HMM/RBFN 시스템 | - |
dc.title | (A) hybrid HMM/RBFN system for automatic speech recognition | - |
dc.title.alternative | 음성인식을 위한 결합 HMM/RBFN system | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 173583/325007 | - |
dc.description.department | 한국과학기술원 : 응용수학전공, | - |
dc.identifier.uid | 020003095 | - |
dc.contributor.localauthor | Kil, Rhee-Man | - |
dc.contributor.localauthor | 길이만 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.