Group-based speaker embeddings for text-independent speaker verification 문장 독립 화자 검증을 위한 그룹기반 화자 임베딩

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 210
  • Download : 268
Recently, deep speaker embedding approach has been widely used in text-independent speaker verification, which shows better performance than the traditional i-vector approach. In this work, to improve the deep speaker embedding approach, we propose a novel method called group-based speaker embedding which incorporates group information. We cluster all speakers of the training data into a predefined number of groups in an unsupervised manner, so that a fixed-length group embedding represents the corresponding group. A Group Decision Network (GDN) produces a group weight, and an aggregated group embedding is generated from the weighted sum of the group embeddings and the group weights. Finally, we generate a group-based embedding by adding the aggregated group embedding to the deep speaker embedding. In this way, a speaker embedding can reduce the search space of the speaker identity by incorporating group information, and thereby can flexibly represent a significant number of speakers. We conducted experiments using the VoxCeleb1 database to show that our proposed approach can improve the previous approaches.
Publisher
ACOUSTICAL SOC KOREA
Issue Date
2021-09
Language
Korean
Article Type
Article
Citation

JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, v.40, no.5, pp.496 - 502

ISSN
1225-4428
DOI
10.7776/ASK.2021.40.5.496
URI
http://hdl.handle.net/10203/288484
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
122081.pdf(629.35 kB)Download

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0