An effective dissimilarity measure for clustering of high-dimensional categorical data

Cited 8 time in webofscience Cited 3 time in scopus
  • Hit : 789
  • Download : 13
DC FieldValueLanguage
dc.contributor.authorLee, Jeonghoonko
dc.contributor.authorLee, Yoon Joonko
dc.date.accessioned2014-09-04T08:42:20Z-
dc.date.available2014-09-04T08:42:20Z-
dc.date.created2014-04-01-
dc.date.created2014-04-01-
dc.date.issued2014-03-
dc.identifier.citationKNOWLEDGE AND INFORMATION SYSTEMS, v.38, no.3, pp.743 - 757-
dc.identifier.issn0219-1377-
dc.identifier.urihttp://hdl.handle.net/10203/190160-
dc.description.abstractClustering is to group similar data and find out hidden information about the characteristics of dataset for the further analysis. The concept of dissimilarity of objects is a decisive factor for good quality of results in clustering. When attributes of data are not just numerical but categorical and high dimensional, it is not simple to discriminate the dissimilarity of objects which have synonymous values or unimportant attributes. We suggest a method to quantify the level of difference between categorical values and to weigh the implicit influence of each attribute on constructing a particular cluster. Our method exploits distributional information of data correlated with each categorical value so that intrinsic relationship of values can be discovered. In addition, it measures significance of each attribute in constructing respective cluster dynamically. Experiments on real datasets show the propriety and effectiveness of the method, which improves the results considerably even with simple clustering algorithms. Our approach does not couple with a clustering algorithm tightly and can also be applied to various algorithms flexibly.-
dc.languageEnglish-
dc.publisherSPRINGER LONDON LTD-
dc.subjectALGORITHM-
dc.titleAn effective dissimilarity measure for clustering of high-dimensional categorical data-
dc.typeArticle-
dc.identifier.wosid000331974100010-
dc.identifier.scopusid2-s2.0-84894661945-
dc.type.rimsART-
dc.citation.volume38-
dc.citation.issue3-
dc.citation.beginningpage743-
dc.citation.endingpage757-
dc.citation.publicationnameKNOWLEDGE AND INFORMATION SYSTEMS-
dc.identifier.doi10.1007/s10115-012-0599-1-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorLee, Yoon Joon-
dc.contributor.nonIdAuthorLee, Jeonghoon-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorSimilarity-
dc.subject.keywordAuthorDissimilarity-
dc.subject.keywordAuthorClustering-
dc.subject.keywordAuthorCategorical data-
dc.subject.keywordAuthorMulti-valued data-
dc.subject.keywordAuthorHigh-dimensional data-
dc.subject.keywordPlusALGORITHM-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 8 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0