DSP-CC-: I/O Efficient Parallel Computation of Connected Components in Billion-Scale Networks

Cited 5 time in webofscience Cited 4 time in scopus
  • Hit : 160
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Min-Sooko
dc.contributor.authorLee, Sangyeonko
dc.contributor.authorHan, Wook-Shinko
dc.contributor.authorPark, Himchanko
dc.contributor.authorLee, Jeong-Hoonko
dc.date.accessioned2020-06-02T01:21:21Z-
dc.date.available2020-06-02T01:21:21Z-
dc.date.created2020-05-22-
dc.date.created2020-05-22-
dc.date.created2020-05-22-
dc.date.issued2015-04-
dc.identifier.citationIEEE International Conference on Data Engineering ICDE, pp.2658 - 2671-
dc.identifier.urihttp://hdl.handle.net/10203/274440-
dc.description.abstractComputing connected components is a core operation on graph data. Since billion-scale graphs cannot be resident in memory of a single server, several approaches based on distributed machines have recently been proposed. The representative methods are Hash-To-Min and PowerGraph. Hash-To-Min is the state-of-the art disk-based distributed method which minimizes the number of MapReduce rounds. PowerGraph is the-state-of-the-art in-memory distributed system, which is typically faster than the diskbased distributed one, however, requires a lot of machines for handling billion-scale graphs. In this paper, we propose an I/O efficient parallel algorithm for billion-scale graphs in a single PC. We first propose the Disk-based Sequential access-oriented Parallel processing (DSP) model that exploits sequential disk access in terms of disk I/Os and parallel processing in terms of computation. We then propose an ultra-fast disk-based parallel algorithm for computing connected components, DSP-CC, which largely improves the performance through sequential disk scan and page-level cache-conscious parallel processing. Extensive experimental results show that DSP-CC 1) computes connected components in billion-scale graphs using the limited memory size whereas in-memory algorithms can only support medium-sized graphs with the same memory size, and 2) significantly outperforms all distributed competitors as well as a representative disk-based parallel method.-
dc.languageEnglish-
dc.publisherIEEE-
dc.titleDSP-CC-: I/O Efficient Parallel Computation of Connected Components in Billion-Scale Networks-
dc.typeConference-
dc.identifier.wosid000361245300006-
dc.identifier.scopusid2-s2.0-84941554873-
dc.type.rimsCONF-
dc.citation.beginningpage2658-
dc.citation.endingpage2671-
dc.citation.publicationnameIEEE International Conference on Data Engineering ICDE-
dc.identifier.conferencecountryFI-
dc.identifier.conferencelocationhelsinki-
dc.identifier.doi10.1109/TKDE.2015.2419665-
dc.contributor.localauthorKim, Min-Soo-
dc.contributor.nonIdAuthorLee, Sangyeon-
dc.contributor.nonIdAuthorHan, Wook-Shin-
dc.contributor.nonIdAuthorPark, Himchan-
dc.contributor.nonIdAuthorLee, Jeong-Hoon-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 5 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0