Speech recognition using quantized LSP parameters and their transformations in digital communication

Cited 14 time in webofscience Cited 0 time in scopus
  • Hit : 479
  • Download : 0
In digital communication networks, speech recognition systems conventionally first reconstruct speech and then extract feature parameters. In this paper, we consider a useful approach of incorporating speech coding parameters into the speech recognizer. Most speech coders employed in digital communication networks use line spectrum pairs (LSPs) as spectral parameters. We introduce two ways to improve the recognition performance of the LSP-based speech recognizer. One is to devise weighted distance measures of LSPs and the other is to transform LSPs into a new feature set, named pseudo-cepstrum (PCEP). The speaker-independent connected-digit recognition experiments based on the discrete hidden Markov model showed that the weighted distance measures provide better recognition accuracy than unweighted ones do. Additionally, a mel-scale PCEP gives an even better performance than the weighted distance measures do. To clarify the performance improvement of the proposed methods, a significance test is introduced. As a result, the proposed methods achieved higher performances in recognition accuracy, compared with the conventional methods employing mel-frequency cepstral coefficients. (C) 2000 Elsevier Science B.V. All rights reserved.
Publisher
ELSEVIER SCIENCE BV
Issue Date
2000-04
Language
English
Article Type
Article
Citation

SPEECH COMMUNICATION, v.30, no.4, pp.223 - 233

ISSN
0167-6393
URI
http://hdl.handle.net/10203/76824
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 14 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0