Blind source separation exploiting higher-order frequency dependencies

Cited 310 time in webofscience Cited 301 time in scopus
  • Hit : 472
  • Download : 1638
DC FieldValueLanguage
dc.contributor.authorKim, Tko
dc.contributor.authorAttias, HTko
dc.contributor.authorLee, Soo-Youngko
dc.contributor.authorLee, TWko
dc.date.accessioned2009-07-23T02:09:16Z-
dc.date.available2009-07-23T02:09:16Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2007-01-
dc.identifier.citationIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.15, pp.70 - 79-
dc.identifier.issn1558-7916-
dc.identifier.urihttp://hdl.handle.net/10203/10202-
dc.description.abstractBlind source separation (BSS) is a challenging problem in real-world environments where sources are time delayed and convolved. The problem becomes more difficult in very reverberant conditions, with an increasing number of sources, and geometric configurations of the sources such that finding directionality is not sufficient for source separation. In this paper, we propose a new algorithm that exploits higher order frequency dependencies of source signals in order to separate them when they are mixed. In the frequency domain, this formulation assumes that dependencies exist between frequency bins instead of defining independence for each frequency bin. In this manner, we can avoid the well-known frequency permutation problem. To derive the learning algorithm, we define a cost function, which is an extension of mutual information between multivariate random variables. By introducing a source prior that models the inherent frequency dependencies, we obtain a simple form of a multivariate score function. In experiments, we generate simulated data with various kinds of sources in various environments. We evaluate the performances and compare it with other well-known algorithms. The results show the proposed algorithm outperforms the others in most cases. The algorithm is also able to accurately recover six sources with six microphones. In this case, we can obtain about 16-dB signal-to-interference ratio (SIR) improvement. Similar performance is observed in real conference room recordings with three human speakers reading sentences and one loudspeaker playing music.-
dc.languageEnglish-
dc.language.isoen_USen
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectICA MIXTURE-MODELS-
dc.subjectNATURAL IMAGES-
dc.subjectCLASSIFICATION-
dc.subjectDOMAIN-
dc.titleBlind source separation exploiting higher-order frequency dependencies-
dc.typeArticle-
dc.identifier.wosid000243286900007-
dc.identifier.scopusid2-s2.0-34247155553-
dc.type.rimsART-
dc.citation.volume15-
dc.citation.beginningpage70-
dc.citation.endingpage79-
dc.citation.publicationnameIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING-
dc.identifier.doi10.1109/TASL.2006.872618-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorLee, Soo-Young-
dc.contributor.nonIdAuthorKim, T-
dc.contributor.nonIdAuthorAttias, HT-
dc.contributor.nonIdAuthorLee, TW-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorblind source separation (BSS)-
dc.subject.keywordAuthorcocktail party problem-
dc.subject.keywordAuthorconvolutive mixture-
dc.subject.keywordAuthorfrequency domain-
dc.subject.keywordAuthorhigher order dependency-
dc.subject.keywordAuthorindependent component analysis-
dc.subject.keywordAuthorpermutation problem-
dc.subject.keywordPlusICA MIXTURE-MODELS-
dc.subject.keywordPlusNATURAL IMAGES-
dc.subject.keywordPlusCLASSIFICATION-
dc.subject.keywordPlusDOMAIN-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 310 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0