Spatial hearing algorithms based on binaural zero-crossings : sound source localization, segregation, and dereverberation영교차점에 기초한 공간 청각 알고리즘 : 음원 국지화, 분리 및 반향제거

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 638
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKil, Rhee-Man-
dc.contributor.advisor길이만-
dc.contributor.authorKim, Young-Ik-
dc.contributor.author김영익-
dc.date.accessioned2011-12-14T04:40:08Z-
dc.date.available2011-12-14T04:40:08Z-
dc.date.issued2007-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=263486&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/41893-
dc.description학위논문(박사) - 한국과학기술원 : 응용수학전공, 2007.2, [ xi, 94 p. ]-
dc.description.abstractThis thesis concerns a new zero-crossing-based binaural model for spatial hearing. Conventional binaural model computes cross-correlations of binaural signals for the estimation of the interaural time difference which is a primary spatial cue. However, the cross-correlation-based binaural processing model requires high computational complexity and suffers from inaccuracies in localizing sound sources especially in a noisy multisource environment. The proposed model extracts two important binaural cues of interaural time difference (ITD) and interaural intensity difference (IID) on the basis of zero-crossing times and interval powers of filtered signal. This fundamental difference on binaural cue extraction gives great flexibility on designing spatial hearing algorithms. Another distinctive feature of our model is to estimate the signal-to-noise ratios (SNRs) of filtered signal using the variances of ITD sample, enabling us to perform noise-robust estimation of ITDs using the estimated SNRs. Using the zero-crossing-based binaural model, we developed three novel algorithms on spatial hearing: localization, segregation, and dereverberation. Localization: On the histogram of ITD samples weighted by the estimated SNRs, multiple sound source directions are localized in noisy environments. In the experiments on noisy multisource environments, the proposed localization algorithm provided more accurate noise robust estimation of sound source directions compared conventional cross-correlation-based method. Segregation: Using the locations of sound sources, we assigned each zero-crossing interval power to one of the sound source to estimate the target-to-interferers power ratio. Then two types of masks, binary and soft, derived from the estimated power ratios for the segregation and missing data recognition tasks. On both the speech segregation and recognition tests, our ratio mask showed superior results to the cross-correlation-ba...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectzero-crossing-
dc.subjectspatial hearing-
dc.subjectsound source localization-
dc.subject반향제거-
dc.subjectspeech segregation-
dc.subjectdereverberation-
dc.subject공간청각-
dc.subject영교차-
dc.subject음원 국지화-
dc.subject음성 분리-
dc.titleSpatial hearing algorithms based on binaural zero-crossings-
dc.title.alternative영교차점에 기초한 공간 청각 알고리즘 : 음원 국지화, 분리 및 반향제거-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN263486/325007 -
dc.description.department한국과학기술원 : 응용수학전공, -
dc.identifier.uid020035053-
dc.contributor.localauthorKil, Rhee-Man-
dc.contributor.localauthor길이만-
dc.title.subtitlesound source localization, segregation, and dereverberation-
Appears in Collection
MA-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0