The high-dimension, low-sample-size geometric representation holds under mild conditions

Cited 98 time in webofscience Cited 0 time in scopus
  • Hit : 136
  • Download : 0
High-dimension, low-small-sample size datasets have different geometrical properties from those of traditional low-dimensional data. In their asymptotic study regarding increasing dimensionality with a fixed sample size, Hall et al. ( 2005) showed that each data vector is approximately located on the vertices of a regular simplex in a high-dimensional space. A perhaps unappealing aspect of their result is the underlying assumption which requires the variables, viewed as a time series, to be almost independent. We establish an equivalent geometric representation under much milder conditions using asymptotic properties of sample covariance matrices. We discuss implications of the results, such as the use of principal component analysis in a high-dimensional space, extension to the case of nonindependent samples and also the binary classification problem.
Publisher
OXFORD UNIV PRESS
Issue Date
2007-08
Language
English
Article Type
Article
Citation

BIOMETRIKA, v.94, no.3, pp.760 - 766

ISSN
0006-3444
DOI
10.1093/biomet/asm050
URI
http://hdl.handle.net/10203/285439
Appears in Collection
IE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 98 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0