DC Field | Value | Language |
---|---|---|
dc.contributor.author | Seo, Junghyuk | ko |
dc.contributor.author | Kim, Myoung Ho | ko |
dc.date.accessioned | 2021-02-22T08:50:06Z | - |
dc.date.available | 2021-02-22T08:50:06Z | - |
dc.date.created | 2020-11-02 | - |
dc.date.created | 2020-11-02 | - |
dc.date.issued | 2021-04 | - |
dc.identifier.citation | EXPERT SYSTEMS WITH APPLICATIONS, v.168, pp.114221 | - |
dc.identifier.issn | 0957-4174 | - |
dc.identifier.uri | http://hdl.handle.net/10203/280965 | - |
dc.description.abstract | In recent years, the size of graph data has increased significantly, but most existing graph clustering algorithms do not consider the case where the size of main memory is not sufficient to handle large amount of graph data. Exploring entire region of graph for clustering causes too many random disk accesses to use data that are not loaded into memory, resulting in excessive disk I/O and thrashing. To address this problem, we propose an I/O-efficient algorithm for structural clustering of a graph, called pm-SCAN. In the proposed method, if memory is insufficient, an input graph is partitioned into several subgraphs smaller than memory, and clustering is first performed for each subgraph. And then clusters from the subgraphs are merged based on connectivity between clusters so that global results can be obtained in the point of view of an original input graph. Not only does pm-SCAN produce scalable performance even for very large graphs, i.e., significant shortage of available memory, but also the result of pm-SCAN is the same as that of the original structural clustering algorithm SCAN. We also propose a cluster maintenance method for large-scale dynamic graphs that change over time. Instead of reclustering with a whole graph, only a small set of nodes whose structural connectivities are subject to change by a given update operation is first identified, and we access only those nodes in disk and update their clusters to reduce maintenance costs. This dynamic graph handling mechanism shows significant performance improvement compared to the existing method and the baseline that performs clustering from scratch. © 2020 Elsevier Ltd | - |
dc.language | English | - |
dc.publisher | PERGAMON-ELSEVIER SCIENCE LTD | - |
dc.title | I/O Efficient Structural Clustering and Maintenance of Clusters for Large-scale Graphs | - |
dc.type | Article | - |
dc.identifier.wosid | 000615906200009 | - |
dc.identifier.scopusid | 2-s2.0-85096576137 | - |
dc.type.rims | ART | - |
dc.citation.volume | 168 | - |
dc.citation.beginningpage | 114221 | - |
dc.citation.publicationname | EXPERT SYSTEMS WITH APPLICATIONS | - |
dc.identifier.doi | 10.1016/j.eswa.2020.114221 | - |
dc.contributor.localauthor | Kim, Myoung Ho | - |
dc.description.isOpenAccess | N | - |
dc.type.journalArticle | Article | - |
dc.subject.keywordAuthor | Cluster maintenance | - |
dc.subject.keywordAuthor | Dynamic graph | - |
dc.subject.keywordAuthor | Graph | - |
dc.subject.keywordAuthor | I/O-efficient algorithm | - |
dc.subject.keywordAuthor | Structural graph clustering | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.