Robust speech recognition under noisy environments using Lombard effect compensation and dynamic characteristic롬바드 효과의 보정과 동적특성을 이용한 잡음환경에 강인한 음성인식

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 828
  • Download : 0
Although speech recognition systems in artificially constrained conditions have already reached high levels of performance, they degrade dramatically when they are applied in the real world, particularly in noisy environments. In noisy environments human speech productions are influenced by noises (Lombard effect) and speech signals are contaminated in ways which affect the performance of speech recognition systems. This thesis describes a Lombard effect compensation and noise suppression method of improving speech recognition performance in noisy environments. First of all, to become familiar with the characteristics of speech affected by the Lombard effect, several features of Lombard speech were investigated. We found that vocal intensity was varied by the Lombard effect, and that the Lombard effect had a statistically significant influence on energy below 500Hz low-band spectral tilt, and the center of gravity determining the spectral structure of speech. Since speech production variations due to the Lombard effect depends on the intensity of the Lombard effect, we formulated a quantitative measure of the Lombard effect level so as to model it more explicitly. Statistical tests were used to discover those features that represent the Lombard effect. A speech degradation model is proposed in order to characterize the distortions of speech in noise and under the Lombard effect. Variations in formant location, formant bandwidth, pitch, spectral tilt, and energy in each frequency band under the Lombard effect are represented by frequency warping and amplitude scaling of each frequency band. Another Lombard effect, the variation of vocal intensity is modeled by a multiplication term depending on the energy of the input speech. Noise contamination is represented by an additive term in the frequency domain. The distortions of noisy Lombard speech are then canceled out according to the speech degradation model. First, spectral subtraction is used to suppress the...
Advisors
Oh, Yung-Hwanresearcher오영환researcher
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
1998
Identifier
143502/325007 / 000935359
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학과, 1998.8, [ 3, 93 p. ]

Keywords

Noise suppression; Speech recongnition; Lombard effect compensation; 롬바드 효과의 보정; 잡음 제거; 음성인식

URI
http://hdl.handle.net/10203/33115
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=143502&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0