AASIST: AUDIO ANTI-SPOOFING USING INTEGRATED SPECTRO-TEMPORAL GRAPH ATTENTION NETWORKS

Cited 59 time in webofscience Cited 0 time in scopus
  • Hit : 448
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorJung, Jee-Weonko
dc.contributor.authorHeo, Hee-Sooko
dc.contributor.authorTak, Hemlatako
dc.contributor.authorShim, Hye-Jinko
dc.contributor.authorChung, Joon Sonko
dc.contributor.authorLee, Bong-Jinko
dc.contributor.authorYu, Ha-Jinko
dc.contributor.authorEvans, Nicholasko
dc.date.accessioned2022-11-17T06:01:44Z-
dc.date.available2022-11-17T06:01:44Z-
dc.date.created2022-09-27-
dc.date.created2022-09-27-
dc.date.issued2022-05-
dc.identifier.citation47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, pp.2405 - 2409-
dc.identifier.issn1520-6149-
dc.identifier.urihttp://hdl.handle.net/10203/299796-
dc.description.abstractArtefacts that differentiate spoofed from bona-fide utterances can reside in specific temporal or spectral intervals. Their reliable detection usually depends upon computationally demanding ensemble systems where each subsystem is tuned to some specific artefacts. We seek to develop an efficient, single system that can detect a broad range of different spoofing attacks without score-level ensembles. We propose a novel heterogeneous stacking graph attention layer that models artefacts spanning heterogeneous temporal and spectral intervals with a heterogeneous attention mechanism and a stack node. With a new max graph operation that involves a competitive mechanism and a new readout scheme, our approach, named AASIST, outperforms the current state-of-the-art by 20% relative. Even a lightweight variant, AASIST-L, with only 85k parameters, outperforms all competing systems.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleAASIST: AUDIO ANTI-SPOOFING USING INTEGRATED SPECTRO-TEMPORAL GRAPH ATTENTION NETWORKS-
dc.typeConference-
dc.identifier.wosid000864187906131-
dc.identifier.scopusid2-s2.0-85128259138-
dc.type.rimsCONF-
dc.citation.beginningpage2405-
dc.citation.endingpage2409-
dc.citation.publicationname47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationVirtual, Online-
dc.identifier.doi10.1109/ICASSP43922.2022.9747766-
dc.contributor.localauthorChung, Joon Son-
dc.contributor.nonIdAuthorJung, Jee-Weon-
dc.contributor.nonIdAuthorHeo, Hee-Soo-
dc.contributor.nonIdAuthorTak, Hemlata-
dc.contributor.nonIdAuthorShim, Hye-Jin-
dc.contributor.nonIdAuthorLee, Bong-Jin-
dc.contributor.nonIdAuthorYu, Ha-Jin-
dc.contributor.nonIdAuthorEvans, Nicholas-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 59 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0