Voice Activity Detection Using an Adaptive Context Attention Model

Cited 59 time in webofscience Cited 0 time in scopus
  • Hit : 942
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Juntaeko
dc.contributor.authorHahn, Minsooko
dc.date.accessioned2018-08-20T06:48:22Z-
dc.date.available2018-08-20T06:48:22Z-
dc.date.created2018-07-22-
dc.date.created2018-07-22-
dc.date.issued2018-08-
dc.identifier.citationIEEE SIGNAL PROCESSING LETTERS, v.25, no.8, pp.1181 - 1185-
dc.identifier.issn1070-9908-
dc.identifier.urihttp://hdl.handle.net/10203/244655-
dc.description.abstractVoice activity detection (VAD) classifies incoming signal segments into speech or background noise; its performance is crucial in various speech-related applications. Although speech-signal context is a relevant VAD asset, its usefulness varies in unpredictable noise environments. Therefore, its usage should be adaptively adjustable to the noise type. This letter improves the use of context information by using an adaptive context attention model (ACAM) with a novel training strategy for effective attention, which weights the most crucial parts of the context for proper classification. Experiments in real-world scenarios demonstrate that the proposed ACAM-based VAD outperforms the other baseline VAD methods.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectSPEECH ACTIVITY DETECTION-
dc.subjectNEURAL-NETWORKS-
dc.subjectINFORMATION-
dc.subjectFILTER-
dc.subjectNOISE-
dc.titleVoice Activity Detection Using an Adaptive Context Attention Model-
dc.typeArticle-
dc.identifier.wosid000437417100005-
dc.identifier.scopusid2-s2.0-85043376020-
dc.type.rimsART-
dc.citation.volume25-
dc.citation.issue8-
dc.citation.beginningpage1181-
dc.citation.endingpage1185-
dc.citation.publicationnameIEEE SIGNAL PROCESSING LETTERS-
dc.identifier.doi10.1109/LSP.2018.2811740-
dc.contributor.localauthorHahn, Minsoo-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorDeep neural network (DNN)-
dc.subject.keywordAuthorvoice activity detection (VAD)-
dc.subject.keywordPlusSPEECH ACTIVITY DETECTION-
dc.subject.keywordPlusNEURAL-NETWORKS-
dc.subject.keywordPlusINFORMATION-
dc.subject.keywordPlusFILTER-
dc.subject.keywordPlusNOISE-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 59 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0