DSpace at KOASAS: Robust speech recognition under noisy environments using Lombard effect compensation and dynamic characteristic

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Ph.D.(박사논문)

Robust speech recognition under noisy environments using Lombard effect compensation and dynamic characteristic롬바드 효과의 보정과 동적특성을 이용한 잡음환경에 강인한 음성인식

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 828
Download : 0

Export

Chi, Sang-Mun / 지상문

Although speech recognition systems in artificially constrained conditions have already reached high levels of performance, they degrade dramatically when they are applied in the real world, particularly in noisy environments. In noisy environments human speech productions are influenced by noises (Lombard effect) and speech signals are contaminated in ways which affect the performance of speech recognition systems. This thesis describes a Lombard effect compensation and noise suppression method of improving speech recognition performance in noisy environments. First of all, to become familiar with the characteristics of speech affected by the Lombard effect, several features of Lombard speech were investigated. We found that vocal intensity was varied by the Lombard effect, and that the Lombard effect had a statistically significant influence on energy below 500Hz low-band spectral tilt, and the center of gravity determining the spectral structure of speech. Since speech production variations due to the Lombard effect depends on the intensity of the Lombard effect, we formulated a quantitative measure of the Lombard effect level so as to model it more explicitly. Statistical tests were used to discover those features that represent the Lombard effect. A speech degradation model is proposed in order to characterize the distortions of speech in noise and under the Lombard effect. Variations in formant location, formant bandwidth, pitch, spectral tilt, and energy in each frequency band under the Lombard effect are represented by frequency warping and amplitude scaling of each frequency band. Another Lombard effect, the variation of vocal intensity is modeled by a multiplication term depending on the energy of the input speech. Noise contamination is represented by an additive term in the frequency domain. The distortions of noisy Lombard speech are then canceled out according to the speech degradation model. First, spectral subtraction is used to suppress the...

Advisors: Oh, Yung-Hwan researcher; 오영환 researcher

Description: 한국과학기술원 : 전산학과,

Publisher: 한국과학기술원

Issue Date: 1998

Identifier: 143502/325007 / 000935359

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전산학과, 1998.8, [ 3, 93 p. ]

Keywords: Noise suppression; Speech recongnition; Lombard effect compensation; 롬바드 효과의 보정; 잡음 제거; 음성인식

URI: http://hdl.handle.net/10203/33115

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=143502&flag=dissertation

Appears in Collection: CS-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Robust speech recognition under noisy environments using Lombard effect compensation and dynamic characteristic롬바드 효과의 보정과 동적특성을 이용한 잡음환경에 강인한 음성인식

KOASAS

Communities & Collections