Distance-based outlier detection for high dimension, low sample size data

Cited 10 time in webofscience Cited 0 time in scopus
  • Hit : 263
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorAhn, Jeongyounko
dc.contributor.authorLee, Myung Heeko
dc.contributor.authorLee, Jung Aeko
dc.date.accessioned2021-06-02T02:50:10Z-
dc.date.available2021-06-02T02:50:10Z-
dc.date.created2021-06-02-
dc.date.created2021-06-02-
dc.date.issued2019-01-
dc.identifier.citationJOURNAL OF APPLIED STATISTICS, v.46, no.1, pp.13 - 29-
dc.identifier.issn0266-4763-
dc.identifier.urihttp://hdl.handle.net/10203/285422-
dc.description.abstractDespite the popularity of high dimension, low sample size data analysis, there has not been enough attention to the sample integrity issue, in particular, a possibility of outliers in the data. A new outlier detection procedure for data with much larger dimensionality than the sample size is presented. The proposed method is motivated by asymptotic properties of high-dimensional distance measures. Empirical studies suggest that high-dimensional outlier detection is more likely to suffer from a swamping effect rather than a masking effect, thus yields more false positives than false negatives. We compare the proposed approaches with existing methods using simulated data from various population settings. A real data example is presented with a consideration on the implication of found outliers.-
dc.languageEnglish-
dc.publisherTAYLOR & FRANCIS LTD-
dc.titleDistance-based outlier detection for high dimension, low sample size data-
dc.typeArticle-
dc.identifier.wosid000449972800002-
dc.identifier.scopusid2-s2.0-85044385520-
dc.type.rimsART-
dc.citation.volume46-
dc.citation.issue1-
dc.citation.beginningpage13-
dc.citation.endingpage29-
dc.citation.publicationnameJOURNAL OF APPLIED STATISTICS-
dc.identifier.doi10.1080/02664763.2018.1452901-
dc.contributor.localauthorAhn, Jeongyoun-
dc.contributor.nonIdAuthorLee, Myung Hee-
dc.contributor.nonIdAuthorLee, Jung Ae-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorCentroid distance-
dc.subject.keywordAuthorHDLSS-
dc.subject.keywordAuthorhigh-dimensional asymptotics-
dc.subject.keywordAuthormaximal data piling distance-
dc.subject.keywordAuthormultiple outliers-
dc.subject.keywordPlusGEOMETRIC REPRESENTATION-
dc.subject.keywordPlusMODEL-
Appears in Collection
IE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 10 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0