음소인식 오류에 강인한 N-gram 기반 음성 문서 검색N-gram Based Robust Spoken Document Retrievals for Phoneme Recognition Errors

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 805
  • Download : 22
In spoken document retrievals (SDR), subword (typically phonemes) indexing term is used to avoid the out-of-vocabulary (OOV) problem. It makes the indexing and retrieval process independent from any vocabulary. It also requires a small corpus to train the acoustic model. However, subword indexing term approach has a major drawback. It shows higher word error rates than the large vocabulary continuous speech recognition (LVCSR) system. In this paper, we propose an probabilistic slot detection and n-gram based string matching method for phone based spoken document retrievals to overcome high error rates of phone recognizer. Experimental results have shown 9.25% relative improvement in the mean average precision (mAP) with 1.7 times speed up in comparison with the baseline system.
Publisher
대한음성학회
Issue Date
2008-09
Language
Korean
Citation

말소리, v.1, no.67, pp.149 - 166

ISSN
1226-1173
URI
http://hdl.handle.net/10203/17647
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0