DC Field | Value | Language |
---|---|---|
dc.contributor.author | Poythress, JC | ko |
dc.contributor.author | Park, Cheolwoo | ko |
dc.contributor.author | Ahn, Jeongyoun | ko |
dc.date.accessioned | 2022-10-30T09:00:13Z | - |
dc.date.available | 2022-10-30T09:00:13Z | - |
dc.date.created | 2021-08-31 | - |
dc.date.created | 2021-08-31 | - |
dc.date.created | 2021-08-31 | - |
dc.date.issued | 2022-12 | - |
dc.identifier.citation | JOURNAL OF APPLIED STATISTICS, v.49, no.15, pp.3889 - 3907 | - |
dc.identifier.issn | 0266-4763 | - |
dc.identifier.uri | http://hdl.handle.net/10203/299174 | - |
dc.description.abstract | Many research proposals involve collecting multiple sources of information from a set of common samples, with the goal of performing an integrative analysis describing the associations between sources. We propose a method that characterizes the dominant modes of co-variation between the variables in two datasets while simultaneously performing variable selection. Our method relies on a sparse, low rank approximation of a matrix containing pairwise measures of association between the two sets of variables. We show that the proposed method shares a close connection with another group of methods for integrative data analysis - sparse canonical correlation analysis (CCA). Under some assumptions, the proposed method and sparse CCA aim to select the same subsets of variables. We show through simulation that the proposed method can achieve better variable selection accuracies than two state-of-the-art sparse CCA algorithms. Empirically, we demonstrate through the analysis of DNA methylation and gene expression data that the proposed method selects variables that have as high or higher canonical correlation than the variables selected by sparse CCA methods, which is a rather surprising finding given that objective function of the proposed method does not actually maximize the canonical correlation. | - |
dc.language | English | - |
dc.publisher | TAYLOR & FRANCIS LTD | - |
dc.title | Dimension-wise sparse low-rank approximation of a matrix with application to variable selection in high-dimensional integrative analyses of association | - |
dc.type | Article | - |
dc.identifier.wosid | 000686490200001 | - |
dc.identifier.scopusid | 2-s2.0-85113263635 | - |
dc.type.rims | ART | - |
dc.citation.volume | 49 | - |
dc.citation.issue | 15 | - |
dc.citation.beginningpage | 3889 | - |
dc.citation.endingpage | 3907 | - |
dc.citation.publicationname | JOURNAL OF APPLIED STATISTICS | - |
dc.identifier.doi | 10.1080/02664763.2021.1967892 | - |
dc.contributor.localauthor | Park, Cheolwoo | - |
dc.contributor.localauthor | Ahn, Jeongyoun | - |
dc.contributor.nonIdAuthor | Poythress, JC | - |
dc.description.isOpenAccess | N | - |
dc.type.journalArticle | Article | - |
dc.subject.keywordAuthor | High dimension low sample size | - |
dc.subject.keywordAuthor | multimodal data | - |
dc.subject.keywordAuthor | nuclear norm | - |
dc.subject.keywordAuthor | sparse canonical correlation analysis | - |
dc.subject.keywordPlus | CANONICAL CORRELATION | - |
dc.subject.keywordPlus | MODEL SELECTION | - |
dc.subject.keywordPlus | REGRESSION | - |
dc.subject.keywordPlus | REDUCTION | - |
dc.subject.keywordPlus | JOINT | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.