Voice Activity Detection Using an Adaptive Context Attention Model

Cited 22 time in webofscience Cited 0 time in scopus
  • Hit : 549
  • Download : 0
Voice activity detection (VAD) classifies incoming signal segments into speech or background noise; its performance is crucial in various speech-related applications. Although speech-signal context is a relevant VAD asset, its usefulness varies in unpredictable noise environments. Therefore, its usage should be adaptively adjustable to the noise type. This letter improves the use of context information by using an adaptive context attention model (ACAM) with a novel training strategy for effective attention, which weights the most crucial parts of the context for proper classification. Experiments in real-world scenarios demonstrate that the proposed ACAM-based VAD outperforms the other baseline VAD methods.
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Issue Date
2018-08
Language
English
Article Type
Article
Keywords

SPEECH ACTIVITY DETECTION; NEURAL-NETWORKS; INFORMATION; FILTER; NOISE

Citation

IEEE SIGNAL PROCESSING LETTERS, v.25, no.8, pp.1181 - 1185

ISSN
1070-9908
DOI
10.1109/LSP.2018.2811740
URI
http://hdl.handle.net/10203/244655
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 22 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0