NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units

Cited 23 time in webofscience Cited 12 time in scopus
  • Hit : 232
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorHyun, Bongjoonko
dc.contributor.authorKwon, Youngeunko
dc.contributor.authorChoi, Yujeongko
dc.contributor.authorKim, John Dongjunko
dc.contributor.authorRhu, Minsooko
dc.date.accessioned2020-09-18T04:17:17Z-
dc.date.available2020-09-18T04:17:17Z-
dc.date.created2020-08-12-
dc.date.created2020-08-12-
dc.date.issued2020-03-20-
dc.identifier.citationThe 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25), pp.1109 - 1124-
dc.identifier.urihttp://hdl.handle.net/10203/276229-
dc.description.abstractTo satisfy the compute and memory demands of deep neural networks (DNNs), neural processing units (NPUs) are widely being utilized for accelerating DNNs. Similar to how GPUs have evolved from a slave device into a mainstream processor architecture, it is likely that NPUs will become first-class citizens in this fast-evolving heterogeneous architecture space. This paper makes a case for enabling address translation in NPUs to decouple the virtual and physical memory address space. Through a careful data-driven application characterization study, we root-cause several limitations of prior GPU-centric address translation schemes and propose a memory management unit (MMU) that is tailored for NPUs. Compared to an oracular MMU design point, our proposal incurs only an average 0.06% performance overhead.-
dc.languageEnglish-
dc.publisherACM-
dc.titleNeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units-
dc.typeConference-
dc.identifier.wosid000541369300070-
dc.identifier.scopusid2-s2.0-85082394185-
dc.type.rimsCONF-
dc.citation.beginningpage1109-
dc.citation.endingpage1124-
dc.citation.publicationnameThe 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25)-
dc.identifier.conferencecountrySZ-
dc.identifier.conferencelocationLausanne-
dc.identifier.doi10.1145/3373376.3378494-
dc.contributor.localauthorKim, John Dongjun-
dc.contributor.localauthorRhu, Minsoo-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 23 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0