Compensating acoustic mismatch using class-based histogram equalization for robust speech recognition

Cited 4 time in webofscience Cited 6 time in scopus
  • Hit : 273
  • Download : 718
A new class-based histogram equalization method is proposed for robust speech recognition. The proposed method aims at not only compensating for an acoustic mismatch between training and test environments but also reducing the two fundamental limitations of the conventional histogram equalization method, the discrepancy between the phonetic distributions of training and test speech data, and the nonmonotonic transformation caused by the acoustic mismatch. The algorithm employs multiple class-specific reference and test cumulative distribution functions, classifies noisy test features into their corresponding classes, and equalizes the features by using their corresponding class reference and test distributions. The minimum mean-square error log-spectral amplitude (MMSE-LSA)-based speech enhancement is added just prior to the baseline feature extraction to reduce the corruption by additive noise. The experiments on the Aurora2 database proved the effectiveness of the proposed method by reducing relative errors by 62% over the mel-cepstral-based features and by 23% over the conventional histogram equalization method, respectively.
Publisher
SPRINGER
Issue Date
2007-03
Language
English
Article Type
Article
Keywords

SPECTRAL AMPLITUDE ESTIMATOR; SOFT DECISION; ENHANCEMENT; NOISE; TRANSFORMATIONS; TIME

Citation

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, v.2007

ISSN
1687-6172
DOI
10.1155/2007/67870
URI
http://hdl.handle.net/10203/23722
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
000245274600001.pdf(1.51 MB)Download
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0