DSpace at KOASAS: Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination

Cited 5 time in

Cited 0 time in

Hit : 549
Download : 10

Export

DC Field	Value	Language
dc.contributor.author	Kim, Sungtak	ko
dc.contributor.author	Ji, Miyoung	ko
dc.contributor.author	Kim, HoiRin	ko
dc.date.accessioned	2010-12-03T08:31:17Z	-
dc.date.available	2010-12-03T08:31:17Z	-
dc.date.created	2012-02-06	-
dc.date.created	2012-02-06	-
dc.date.issued	2010-05	-
dc.identifier.citation	PATTERN RECOGNITION LETTERS, v.31, no.7, pp.593 - 599	-
dc.identifier.issn	0167-8655	-
dc.identifier.uri	http://hdl.handle.net/10203/20697	-
dc.description.abstract	This paper presents a new method to improve features derived from filtering in autocorrelation domain, which are called relative autocorrelation sequence mel-frequency cepstral coefficients (RAS-MFCCs), one of the successful features in autocorrelation domain for noise-robust speaker recognition. The RAS-MFCCs are derived by applying temporal filtering to autocorrelation sequences under the assumption that corrupting noise is stationary. However, the use of only the filtered sequences could cause performance degradation due to the use of restricted information, and the assumption that noise is stationary might result in leaving non-stationary noise components in filtered autocorrelation sequences in real environments. To compensate for the restricted information, we propose a multi-streaming feature extraction that uses autocorrelation sequences as well as temporally filtered autocorrelation sequences for feature extraction. Furthermore, a hybrid feature representation, in which the multi-streaming feature extraction and the sub-band feature recombination are combined, is proposed to reduce the noise effects of autocorrelation sequences and the residual-noise effects of temporally filtered autocorrelation sequences. To evaluate the effectiveness of the proposed hybrid speaker recognition system in noisy conditions, we use the TIMIT database and the NTIMIT database. Experiments on the T1MIT database prove the effectiveness of the proposed hybrid method by reducing errors up to 26% and 14% over the conventional RAS-MFCCs in speaker identification and verification, respectively. On the NTIMIT database, the proposed hybrid feature representation provides error reduction of 24% and 18% over the conventional RAS-MFCCs for speaker identification and verification. (C) 2009 Elsevier B.V. All rights reserved.	-
dc.language	English	-
dc.language.iso	en_US	en
dc.publisher	ELSEVIER SCIENCE BV	-
dc.subject	SCORE NORMALIZATION	-
dc.subject	IDENTIFICATION	-
dc.subject	SPEECH	-
dc.subject	MODELS	-
dc.title	Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination	-
dc.type	Article	-
dc.identifier.wosid	000276700500008	-
dc.identifier.scopusid	2-s2.0-77949271463	-
dc.type.rims	ART	-
dc.citation.volume	31	-
dc.citation.issue	7	-
dc.citation.beginningpage	593	-
dc.citation.endingpage	599	-
dc.citation.publicationname	PATTERN RECOGNITION LETTERS	-
dc.embargo.liftdate	9999-12-31	-
dc.embargo.terms	9999-12-31	-
dc.contributor.localauthor	Kim, HoiRin	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Relative autocorrelation sequence	-
dc.subject.keywordAuthor	mel-frequency cepstral coefficients	-
dc.subject.keywordAuthor	Temporal filtering	-
dc.subject.keywordAuthor	Multi-streaming feature extraction	-
dc.subject.keywordAuthor	Hybrid feature representation	-
dc.subject.keywordAuthor	Sub-band feature recombination	-
dc.subject.keywordPlus	SCORE NORMALIZATION	-
dc.subject.keywordPlus	IDENTIFICATION	-
dc.subject.keywordPlus	SPEECH	-
dc.subject.keywordPlus	MODELS	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 5 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination

This item is cited by other documents in WoS

KOASAS

Communities & Collections