Efficient task-mapping of parallel applications using a space-filling curve

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 60
  • Download : 0
Improving the communication performance of parallel programs is an important but difcult problem in a large-scale distributed memory-based cluster. Efforts to improve parallel scalability often face severe huddles in managing communication overheads. This paper proposes a framework of a space-fling curve(SFC)-based task-remapping for communication intensive parallel applications. An SFC-based mapping, when applied for task-mapping of parallel applications preserves locality in terms of communications and produce a less fragmented task-mapping, reducing communication overheads. The framework also provides tools for performance analysis to see if the proposed task-mapping is appropriate for a given application running on a target system. It further develops a binary classifer as a predictor to decide whether or not to apply the proposed mapping before run-time. We evaluate the framework with three communication intensive applications in Cartesian coordinates: P3DFFT solver and Channel code using 2D domain decomposition model, and Poisson solver using 3D domain decomposition. The evaluation is conducted on a large-scale cluster system of fat-tree topology with up to 1,024 compute nodes. The proposed task-mapping achieves the overall performance improvement ranging from ∼30% to ∼66% over the baseline approach depending on the workloads. Also, when used in combination with the binary classifer-based predictor, it achieves the expected performance gains from 4% to 8%.
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2022-10
Language
English
Citation

31st International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, pp.384 - 397

DOI
10.1145/3559009.3569657
URI
http://hdl.handle.net/10203/312570
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0