mNPUsim: Evaluating the Effect of Sharing Resources with Multi-Core NPUs

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 81
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorHwang, Soojinko
dc.contributor.authorLee, Sunhoko
dc.contributor.authorKim, Jungwooko
dc.contributor.authorKim, Hongbeanko
dc.contributor.authorHuh, Jaehyukko
dc.date.accessioned2023-11-13T03:01:29Z-
dc.date.available2023-11-13T03:01:29Z-
dc.date.created2023-11-11-
dc.date.issued2023-10-03-
dc.identifier.citationIEEE International Symposium on Workload Characterization-
dc.identifier.urihttp://hdl.handle.net/10203/314506-
dc.description.abstractMulti-core neural processing units (NPUs) have emerged to scale the computation capability of NPUs to efficiently support diverse machine learning tasks. In such multi-core NPUs, workloads in different cores can affect other co-runners by incurring contentions on shared resources such as external memory bandwidth and memory management unit (MMU) for address translation. However, many recent NPU studies use a single-core NPU framework without considering dynamic effect by the shared resources. For this study, we develop a new multi-core NPU simulator to assess the effect of resource sharing accurately. Using the simulator, this paper reports the sharing behaviors of multi-core NPUs with respect to overall throughput and performance variance caused by co-runners. The evaluation of the dual and quad core NPUs shows that sharing MMU and memory bandwidth in general is beneficial for throughput, with minor degradation in fairness. The evaluation also shows that page table walkers in MMU are one of the critical shareable resources. Due to the bursty nature of NPU memory accesses, sharing of walker bandwidth across multiple cores can significantly improve the performance. The study extends the evaluation of scalability with multi-core NPUs, investigating the effect of mapping heterogeneous models to multiple NPUs.-
dc.languageEnglish-
dc.publisherIEEE-
dc.titlemNPUsim: Evaluating the Effect of Sharing Resources with Multi-Core NPUs-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.publicationnameIEEE International Symposium on Workload Characterization-
dc.identifier.conferencecountryBE-
dc.identifier.conferencelocationGhent-
dc.identifier.doi10.1109/IISWC59245.2023.00018-
dc.contributor.localauthorHuh, Jaehyuk-
dc.contributor.nonIdAuthorKim, Jungwoo-
dc.contributor.nonIdAuthorKim, Hongbean-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0