Bayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures

Cited 2 time in webofscience Cited 0 time in scopus
  • Hit : 331
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorDoo, Woojinko
dc.contributor.authorKim, Heeyoungko
dc.date.accessioned2021-08-10T07:50:43Z-
dc.date.available2021-08-10T07:50:43Z-
dc.date.created2021-04-26-
dc.date.created2021-04-26-
dc.date.issued2021-08-
dc.identifier.citationJOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, v.91, no.12, pp.2551 - 2568-
dc.identifier.issn0094-9655-
dc.identifier.urihttp://hdl.handle.net/10203/287118-
dc.description.abstractWhen clustering high-dimensional data, it is often important to identify variables that discriminate the clusters. Meanwhile, a common issue in clustering is to determine the number of clusters. In this study, we propose a new method that simultaneously performs clustering and variable selection, while inferring the number of clusters from the data. We formulate the clustering problem using a finite mixture model with a symmetric Dirichlet weights prior, while also placing a prior on the number of components. That is, we utilize a mixture of finite mixtures. We handle the variable selection problem by introducing a latent binary vector, which represents the inclusion/exclusion of variables. We update the binary vector for variable selection using a Metropolis algorithm and perform inference on the cluster structure using a split-merge Markov chain Monte Carlo technique. We demonstrate the advantage of our method using simulated and two real DNA microarray datasets.-
dc.languageEnglish-
dc.publisherTAYLOR & FRANCIS LTD-
dc.titleBayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures-
dc.typeArticle-
dc.identifier.wosid000635014000001-
dc.identifier.scopusid2-s2.0-85103409603-
dc.type.rimsART-
dc.citation.volume91-
dc.citation.issue12-
dc.citation.beginningpage2551-
dc.citation.endingpage2568-
dc.citation.publicationnameJOURNAL OF STATISTICAL COMPUTATION AND SIMULATION-
dc.identifier.doi10.1080/00949655.2021.1902526-
dc.contributor.localauthorKim, Heeyoung-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorBayesian inference-
dc.subject.keywordAuthorclustering-
dc.subject.keywordAuthorDNA microarray data-
dc.subject.keywordAuthorfinite mixture model-
dc.subject.keywordAuthorhigh-dimensional data-
dc.subject.keywordAuthorvariable selection-
Appears in Collection
IE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0