GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 502
  • Download : 0
The two major factors affecting speaker identification performance are the degradations introduced by noisy communication channels and mismatch between the training and the testing data properties. During the last several years, Gaussian Mixture Models (GMMs) have become very popular in speaker identification systems and have proven to perform very well for clean wideband speech. However, in noisy environments or for noisy band-limited telephone speech, the performance degrades considerably. It is also well known that speaker’s voice always changes over time because of the varying factors such as verbal usage, vocal tract, mood, and health. In this paper, to cope with the mismatches, we proposed the use of prosodic features such as the mean pitch value in voiced intervals while the weighted filter bank analysis (WFBA) is adopted to increase the discriminating capability of mel frequency cepstral coefficients (MFCCs) for speaker identification. In addition, this thesis includes an exhaustive study on several environments and their combinations in order to produce the most robust speaker identification results. The DWFBA method shows 2.77%~4.65% error reduction rate, added pitch information utilization method produces 21.62%~45.39% error reduction rate and combined DWFBA and pitch information utilizing method produces 31.35%~45.39% error reduction rate comparing to the baseline Gaussian Mixture Model.
Advisors
Hahn, Min-Sooresearcher한민수researcher
Description
한국정보통신대학교 : 공학부,
Publisher
한국정보통신대학교
Issue Date
2004
Identifier
392344/225023 / 020024049
Language
eng
Description

학위논문(석사) - 한국정보통신대학교 : 공학부, 2004, [ vii, 48 p. ]

Keywords

GMM; Weighted filter bank analysis

URI
http://hdl.handle.net/10203/55260
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392344&flag=dissertation
Appears in Collection
School of Engineering-Theses_Master(공학부 석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0