DSpace at KOASAS: PRTFNet: HRTF Individualization for Accurate Spectral Cues Using a Compact PRTF

DSpace at KOASAS

College of Engineering(공과대학)School of Mechanical and Aerospace Engineering(기계항공공학부)Dept. of Mechanical Engineering(기계공학과)ME-Journal Papers(저널논문)

PRTFNet: HRTF Individualization for Accurate Spectral Cues Using a Compact PRTF

Cited 0 time in webofscience

Cited 0 time in

Hit : 115
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Ko, Byeong-Yun	ko
dc.contributor.author	Lee, Gyeong-Tae	ko
dc.contributor.author	Nam, Hyeonuk	ko
dc.contributor.author	Park, Yong-Hwa	ko
dc.date.accessioned	2023-10-04T06:02:59Z	-
dc.date.available	2023-10-04T06:02:59Z	-
dc.date.created	2023-10-04	-
dc.date.created	2023-10-04	-
dc.date.issued	2023-08	-
dc.identifier.citation	IEEE ACCESS, v.11, pp.96119 - 96130	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	http://hdl.handle.net/10203/312973	-
dc.description.abstract	Spatial audio rendering relies on accurate localization perception, which requires individual head-related transfer functions (HRTFs). Previous methods based on deep neural networks (DNNs) for predicting HRTF magnitude spectra from pinna images used HRTF log-magnitude as the network output during the training stage. However, HRTFs encompass the acoustical characteristics of the head and torso, making it challenging to reconstruct the spectral cues necessary for elevation localization. To tackle this issue, we propose PRTFNet to reconstruct the individual spectral cues in HRTFs by mitigating the influence of the head and torso. PRTFNet consists of an end-to-end convolutional neural network (CNN) model and leverages a compact pinna-related transfer function (PRTF) that eliminates the impact of sound reflections from the head and torso in the head-related impulse response (HRIR) as network output. Additionally, we introduce HRTF phase personalization, a technique that utilizes the phase spectra of a selected HRTFs from a database and adjusts the phase by multiplying it by the ratio of the target listener's head width to that of the subject of the selected HRTFs. We evaluated the proposed HRTF individualization methods using the HUTUBS dataset, and the results demonstrate that PRTFNet is highly effective in reconstructing the first and second spectral cues. In terms of log spectral distortion (LSD) and effective LSD (LSDE), PRTFNet outperforms previous deep learning-based model. Furthermore, multiplying the selected phase by the head width ratio reduces the root mean square error (RMSE) of interaural time difference (ITD) by 0.003 ms.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	PRTFNet: HRTF Individualization for Accurate Spectral Cues Using a Compact PRTF	-
dc.type	Article	-
dc.identifier.wosid	001067569700001	-
dc.identifier.scopusid	2-s2.0-85168689812	-
dc.type.rims	ART	-
dc.citation.volume	11	-
dc.citation.beginningpage	96119	-
dc.citation.endingpage	96130	-
dc.citation.publicationname	IEEE ACCESS	-
dc.identifier.doi	10.1109/ACCESS.2023.3308143	-
dc.contributor.localauthor	Park, Yong-Hwa	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Head-related transfer functions	-
dc.subject.keywordAuthor	individualization	-
dc.subject.keywordAuthor	pinna-related transfer functions	-
dc.subject.keywordAuthor	spectral cues	-
dc.subject.keywordAuthor	spatial hearing	-
dc.subject.keywordPlus	SPATIAL-AUDIO	-
dc.subject.keywordPlus	SOUND LOCALIZATION	-
dc.subject.keywordPlus	PARAMETRIC MODEL	-
dc.subject.keywordPlus	HEAD	-
dc.subject.keywordPlus	FREQUENCY	-
dc.subject.keywordPlus	SENSITIVITY	-
dc.subject.keywordPlus	REGRESSION	-
dc.subject.keywordPlus	PEAK	-

Appears in Collection: ME-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

PRTFNet: HRTF Individualization for Accurate Spectral Cues Using a Compact PRTF

KOASAS

Communities & Collections