A NUCA Substrate for Flexible CMP Cache Sharing

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 345
  • Download : 492
We propose an organization for the on-chip memory sys- tem of a chip multiprocessor, in which 16 processors share a 16MB pool of 256 L2 cache banks. The L2 cache is or- ganized as a non-uniform cache architecture (NUCA) array with a switched network embedded in it for high perfor- mance. We show that this organization can support the spectrum of degrees of sharing: unshared, in which each processor has a private portion of the cache, thus reduc- ing hit latency, completely shared, in which every processor shares the entire cache, thus minimizing misses, and every point in between. We find the optimal degree of sharing for a number of cache bank mapping policies, and also evaluate a per-application cache partitioning strategy. We conclude that a static NUCA organization with sharing degrees of two or four work best across a suite of commercial and scientific parallel workloads. We also demonstrate that migratory, dy- namic NUCA approaches improve performance significantly for a subset of the workloads at the cost of increased power consumption and complexity, especially as per-application cache partitioning strategies are applied.
Description
전산학과
Issue Date
2005-06-21
Language
English
Keywords

chip-multiprocessor; cache sharing; non-uniform cache architecture

Citation

International Conference on Supercomputing v. no.1045-9219

ISSN
1045-9219
URI
http://hdl.handle.net/10203/22128
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
nuca_cmp_ics05.pdf(158.52 kB)Download

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0