DSpace at KOASAS: A fast speaker identification method using HMMs and phonetic GMMS

DSpace at KOASAS

College of Engineering(공과대학)KAIST-ICC School of Engineering-Theses_Master(공학부 석사논문)

A fast speaker identification method using HMMs and phonetic GMMSHMM과 음소별 GMM을 결합한 고속 회자식별 방법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 1409
Download : 0

Export

Kwon, Suk-Bong / 권석봉

This thesis proposes a fast text-independent speaker identification method using phonetic GMMs. The individual Gaussian component of a GMM can accurately represent acoustic characteristics of a speaker, so the GMM is effective to make a speaker model in text-independent condition. In the text- dependent speaker identification, input speech content for identification is determined a priori. In this application, using the hidden Markov model (HMM) as a speaker model shows better accuracy, since the HMM can model the temporal structure of the input speech as well as the speaker identity. When we build a speaker GMM for text-independent speaker identification, sufficient training data are required to estimate the GMM parameters precisely. On the other hand, the HMM-based text-independent speaker model doesn``t demand so many training data in building the speaker HMMs. In order to combine the advantages of the GMMs and the HMMs in the text-independent speaker identification, we propose a system architecture using phonetic GMMs. The speaker identification using phonetic GMMs uses three different types of models: speaker-independent phone HMM, baseline speaker GMM, phonetic speaker GMM. The HMM is used to get the segmental information of phones the baseline GMM is used to obtain the N-best speakers from all registered speakers, and the phonetic GMM is finally used to find a person who speaks to the system. From the experiments, as the number of mixtures of the baseline GMM is increased to 320, we obtained an identification accuracy similar to that of the phonetic GMM with 14 mixtures for 45 phones, but the time elapsed to identify the speaker was longer five times than that of the phonetic GMM. Hence the phonetic GMMs can save the elapsed time, but the number of parameters is much greater than that of the baseline GMM because of using three mode types. This problem can be overcome more or less by tying phones into some classes. This is based on the fact that the lik...

Advisors: Kim, Hoi-Rin researcher; 김회린 researcher

Description: 한국정보통신대학원대학교 : 공학부,

Publisher: 한국정보통신대학교

Issue Date: 2004

Identifier: 392461/225023 / 020024132

Language: eng

Description: 학위논문(석사) - 한국정보통신대학원대학교 : 공학부, 2004, [ vii, 44 p. ]

Keywords: HMMs and Phonetic GMMS

URI: http://hdl.handle.net/10203/55324

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392461&flag=dissertation

Appears in Collection: School of Engineering-Theses_Master(공학부 석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A fast speaker identification method using HMMs and phonetic GMMSHMM과 음소별 GMM을 결합한 고속 회자식별 방법

KOASAS

Communities & Collections