Improved speaker recognition with recombination of multi-band features in noisy environments멀티밴드 특징의 재결합을 이용한 잡음환경에서의 화자인식

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 820
  • Download : 0
In this dissertation, we introduce a sub-band feature recombination method and also propose sub-band weighting and sub-band selection. For a more effective feature recombination, the proposed sub-band feature recombination over-comes the ineffective likelihood computation of the conventional feature recombination by using the assumption that each sub-band is independent. In this sub-band feature recombination, sub-band likelihood scores are computed by marginalization from the speaker models given multi-band feature vectors, and it is shown that the performance of this method is shown to be better than the conventional feature recombination for speaker recognition in noisy conditions. For making the sub-band feature recombination more robust to noise, the sub-band weighting and the sub-band selection are proposed. The sub-band weighting coefficients are computed based on the sub-band signal-to-noise ratio which is one of the simple and powerful criteria for sub-band reliability. In the case of applying this sub-band weighting to the sub-band feature recombination, the combination method produces average error reductions of 24.79% and 22.97% over the conventional feature recombination for speaker identification, and also for speaker verification, the average error reduction rates (ERRs) are 29.00% and 24.98% on TIMIT and NTIMIT database. Another method, the sub-band selection, which is based on likelihood scores, is proposed. The likelihood scores can also become a measure of the sub-band reliability. In the case of combining the sub-band selection with the sub-band feature recombination, the error rates are reduced by 25.63% and 9.93% for speaker identification, and for speaker verification, the average ERRs are 30.58% and 25.26% on TIMIT and NTIMIT database, respectively. In previous researches, the relative autocorrelation sequence mel-frequency cepstral coefficient (RAS-MFCC) was proposed as one of the successful features for speaker recognition in noisy env...
Advisors
Kim, Hoi-Rinresearcher김회린researcher
Description
한국정보통신대학교 : 공학부,
Publisher
한국정보통신대학교
Issue Date
2008
Identifier
393025/225023 / 020035901
Language
eng
Description

학위논문(박사) - 한국정보통신대학교 : 공학부, 2008.8, [ xvi, 122 p. ]

Keywords

Multi-Streaming Approach; Sub-Band Reliability; Sub-Band Feature Recombination; Hybrid Feature Representation; 하이브리드 특징벡터 표현; 다중 스트리밍 방법; 서브밴드 신뢰도; 서브밴드 특징벡터 재결합

URI
http://hdl.handle.net/10203/54616
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=393025&flag=dissertation
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0