DSpace at KOASAS: CROSS ATTENTIVE POOLING FOR SPEAKER VERIFICATION

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

CROSS ATTENTIVE POOLING FOR SPEAKER VERIFICATION

Cited 6 time in

Cited 0 time in

Hit : 94
Download : 0

Export

Kye, Seong Min researcher / Kwon, Yoohwan / Chung, Joon Son

The goal of this paper is text-independent speaker verification where utterances come from `in the wild' videos and may contain irrelevant signal. While speaker verification is naturally a pair-wise problem, existing methods to produce the speaker embeddings are instance-wise. In this paper, we propose Cross Attentive Pooling (CAP) that utilises the context information across the referencequery pair to generate utterance-level embeddings that contain the most discriminative information for the pair-wise matching problem. Experiments are performed on the VoxCeleb dataset in which our method outperforms comparable pooling strategies.

Publisher: IEEE

Issue Date: 2021-01

Language: English

Citation: IEEE Spoken Language Technology Workshop (SLT), pp.294 - 300

ISSN: 2639-5479

DOI: 10.1109/SLT48900.2021.9383565

URI: http://hdl.handle.net/10203/288414

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 6 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

CROSS ATTENTIVE POOLING FOR SPEAKER VERIFICATION

This item is cited by other documents in WoS

KOASAS

Communities & Collections