Feature vector classification based on likelihood ratio for speaker identification우도 비 계산 기반 특징 벡터 선택을 통한 화자 식별

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 587
  • Download : 0
This paper describes a new feature vector classification method for speaker identification. Speaker identification system selects the speaker who has the highest likelihood when the test utterance is given. Speaker identification system is composed of two distinct phase, training phase and test phase. In training phase, feature vectors are extracted from every speaker’s training data. Then, speaker models are constructed from each speaker’s feature vectors. In test phase, feature vectors are extracted from the test utterance. Then, the speaker model which has the maximum likelihood with the given feature vectors is selected. In general, similar feature vectors are included in different speakers’ training set because of acoustically similar features between speakers, background silence and environment noise. These similar feature vectors cause the overlap of speaker models which contribute to decision errors. As the more speakers are enrolled, the overlapped regions become bigger. Hence it is important to reduce the effect of the overlapped regions. Recently, a feature vector selection method was proposed to mitigate overlap effect. In this system, they classified feature vectors from training data into two categories, non-overlapped and overlapped. Using these separated feature vectors, they constructed non-overlapped and overlapped speaker models for each speaker respectively. In test phase, they only used feature vectors from test utterance which have the maximum likelihood with non-overlapped speaker models. By using this method the system can use only robust feature vectors and it can have better accuracy when speaker models are overlapped. However, a drawback of the previous method is that they didn’t consider the source causing the overlap. If there are more overlapped feature vectors than non-overlapped ones and most of them are caused by acoustic similarity between speakers, the system accuracy will be lowered than conventional method. In this paper...
Advisors
Oh, Yung-Hwanresearcher오영환researcher
Description
한국과학기술원 : 전산학전공,
Publisher
한국과학기술원
Issue Date
2008
Identifier
297259/325007  / 020063346
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학전공, 2008.2, [ iv, 34 p. ]

Keywords

Speaker identification; Feature vector; Likelihood ratio; Overlap region; Speaker model; 화자 식별; 특징 벡터; 우도 비; 중첩 구간; 화자 모델; Speaker identification; Feature vector; Likelihood ratio; Overlap region; Speaker model; 화자 식별; 특징 벡터; 우도 비; 중첩 구간; 화자 모델

URI
http://hdl.handle.net/10203/34814
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=297259&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0