DSpace at KOASAS: Text-independent speaker recognition system robust to noisy environments

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Text-independent speaker recognition system robust to noisy environments소음 환경에 강인한 문장 독립형 화자인식 시스템

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 375
Download : 0

Export

Kyung, Youn-Jeong / 경연정

In this dissertation work, three methods are proposed to improve the performance of speaker recognition systems in noisy environments such as car noise and white Gaussian noise. To construct the automatic speaker recognition (ASR) system robust to environmental noise, we consider both features and system modeling methods. First, we propose to use prosodic features which represent micro prosody of utterances for speaker recognition. In the case of the background noise, prosodic features and speaking style do not change in contrast with spectral features. The spectral features degrade in noisy environments but the prosodic features are robust. We use the micro prosody which is modeled by segmental pitch contour. Therefore, the codebook is constructed from the segmental pitch contours. Second, the bootstrap and aggregating vector quantization (VQ) model is proposed. In training procedure, new training sets are made from the original training set by bootstrapping. One codebook is formed from each new training set. Each VQ model from the new training set is used for speaker recognition. Finally, the speaker is identified by aggregating the results of all VQ models. We investigate the unstability of VQ model to apply the bootstrap and aggregating method. Although the bagging VQ model improves the recognition rates significantly, it requires larger memory than the conventional VQ model. Therefore, we propose the probability codebook design method for reducing the additional memory by bagging VQ model. This method uses only one universal codebook for all speakers. Finally, we propose the independent components analysis (ICA) mixture model for ASR. The first step of the algorithm is to extract the basis vectors from each speaker. The second step is to compute the probability for each ICA class given test data. The third step is to decide the speaker who has the largest probability of ICA. To improve the recognition rates, we assign the number of basis vectors used for e...

Advisors: Lee, Hwang-Soo researcher; 이황수 researcher

Description: 한국과학기술원 : 전기및전자공학전공,

Publisher: 한국과학기술원

Issue Date: 2000

Identifier: 157628/325007 / 000949519

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기및전자공학전공, 2000.2, [ 100 p. ]

Keywords: ICA; Prosody; VQ model; Speaker recognition; Bootstrap; 벡터 양자화; 부트스트랩; 독립성분분석; 운율; 화자인식

URI: http://hdl.handle.net/10203/35834

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=157628&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Text-independent speaker recognition system robust to noisy environments소음 환경에 강인한 문장 독립형 화자인식 시스템

KOASAS

Communities & Collections