Scalable Data-Driven PageRank: Algorithms, System Issues, and Lessons Learned

Cited 26 time in webofscience Cited 26 time in scopus
  • Hit : 215
  • Download : 0
Large-scale network and graph analysis has received considerable attention recently. Graph mining techniques often involve an iterative algorithm, which can be implemented in a variety of ways. Using PageRank as a model problem, we look at three algorithm design axes: work activation, data access pattern, and scheduling. We investigate the impact of different algorithm design choices. Using these design axes, we design and test a variety of PageRank implementations finding that data-driven, push-based algorithms are able to achieve more than 28x the performance of standard PageRank implementations (e.g., those in GraphLab). The design choices affect both single-threaded performance as well as parallel scalability. The implementation lessons not only guide efficient implementations of many graph mining algorithms, but also provide a framework for designing new scalable algorithms.
Publisher
SPRINGER-VERLAG BERLIN
Issue Date
2015-07
Language
English
Citation

21st International Conference on Parallel and Distributed Computing (Euro-Par), pp.438 - 450

ISSN
0302-9743
DOI
10.1007/978-3-662-48096-0_34
URI
http://hdl.handle.net/10203/275461
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 26 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0