Feature vector classification based on likelihood ratio for speaker identification우도 비 계산 기반 특징 벡터 선택을 통한 화자 식별

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 590
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorOh, Yung-Hwan-
dc.contributor.advisor오영환-
dc.contributor.authorYoon, Sang-Min-
dc.contributor.author윤상민-
dc.date.accessioned2011-12-13T06:07:21Z-
dc.date.available2011-12-13T06:07:21Z-
dc.date.issued2008-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=297259&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/34814-
dc.description학위논문(석사) - 한국과학기술원 : 전산학전공, 2008.2, [ iv, 34 p. ]-
dc.description.abstractThis paper describes a new feature vector classification method for speaker identification. Speaker identification system selects the speaker who has the highest likelihood when the test utterance is given. Speaker identification system is composed of two distinct phase, training phase and test phase. In training phase, feature vectors are extracted from every speaker’s training data. Then, speaker models are constructed from each speaker’s feature vectors. In test phase, feature vectors are extracted from the test utterance. Then, the speaker model which has the maximum likelihood with the given feature vectors is selected. In general, similar feature vectors are included in different speakers’ training set because of acoustically similar features between speakers, background silence and environment noise. These similar feature vectors cause the overlap of speaker models which contribute to decision errors. As the more speakers are enrolled, the overlapped regions become bigger. Hence it is important to reduce the effect of the overlapped regions. Recently, a feature vector selection method was proposed to mitigate overlap effect. In this system, they classified feature vectors from training data into two categories, non-overlapped and overlapped. Using these separated feature vectors, they constructed non-overlapped and overlapped speaker models for each speaker respectively. In test phase, they only used feature vectors from test utterance which have the maximum likelihood with non-overlapped speaker models. By using this method the system can use only robust feature vectors and it can have better accuracy when speaker models are overlapped. However, a drawback of the previous method is that they didn’t consider the source causing the overlap. If there are more overlapped feature vectors than non-overlapped ones and most of them are caused by acoustic similarity between speakers, the system accuracy will be lowered than conventional method. In this paper...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectSpeaker identification-
dc.subjectFeature vector-
dc.subjectLikelihood ratio-
dc.subjectOverlap region-
dc.subjectSpeaker model-
dc.subject화자 식별-
dc.subject특징 벡터-
dc.subject우도 비-
dc.subject중첩 구간-
dc.subject화자 모델-
dc.subjectSpeaker identification-
dc.subjectFeature vector-
dc.subjectLikelihood ratio-
dc.subjectOverlap region-
dc.subjectSpeaker model-
dc.subject화자 식별-
dc.subject특징 벡터-
dc.subject우도 비-
dc.subject중첩 구간-
dc.subject화자 모델-
dc.titleFeature vector classification based on likelihood ratio for speaker identification-
dc.title.alternative우도 비 계산 기반 특징 벡터 선택을 통한 화자 식별-
dc.typeThesis(Master)-
dc.identifier.CNRN297259/325007 -
dc.description.department한국과학기술원 : 전산학전공, -
dc.identifier.uid020063346-
dc.contributor.localauthorOh, Yung-Hwan-
dc.contributor.localauthor오영환-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0