DSpace at KOASAS: Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemic Analysis

DSpace at KOASAS

College of Engineering(공과대학)School of Mechanical and Aerospace Engineering(기계항공공학부)Dept. of Mechanical Engineering(기계공학과)ME-Conference Papers(학술회의논문)

Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemic Analysis

Cited 11 time in

Cited 0 time in

Hit : 62
Download : 0

Export

Kim, Seonghu / Nam, Hyeonuk / Park, Yong-Hwa researcher

In the field of text-independent speaker recognition, dynamic models that adapt along the time axis have been proposed to consider the phoneme-varying characteristics of speech. However, a detailed analysis of how dynamic models work depending on phonemes is insufficient. In this paper, we propose temporal dynamic CNN (TDY-CNN) that considers temporal variation of phonemes by applying kernels optimally adapting to each time bin. These kernels adapt to time bins by applying weighted sum of trained basis kernels. Then, an analysis of how adaptive kernels work on different phonemes in various layers is carried out. TDY-ResNet-38(×0.5) using six basis kernels improved an equal error rate (EER), the speaker verification performance, by 17.3% compared to the baseline model ResNet-38(×0.5). In addition, we showed that adaptive kernels depend on phoneme groups and are more phoneme-specific at early layers. The temporal dynamic model adapts itself to phonemes without explicitly given phoneme information during training, and results show the necessity to consider phoneme variation within utterances for more accurate and robust text-independent speaker verification. © 2022 IEEE

Publisher: IEEE

Issue Date: 2022-05-23

Language: English

Citation: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.6742 - 6746

ISSN: 1520-6149

DOI: 10.1109/icassp43922.2022.9747421

URI: http://hdl.handle.net/10203/298717

Appears in Collection: ME-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 11 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemic Analysis

This item is cited by other documents in WoS

KOASAS

Communities & Collections