DSpace at KOASAS: Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards

Cited 11 time in

Cited 0 time in

Hit : 230
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kwon, Youngeun	ko
dc.contributor.author	Rhu, Minsoo	ko
dc.date.accessioned	2022-11-24T10:01:37Z	-
dc.date.available	2022-11-24T10:01:37Z	-
dc.date.created	2022-11-20	-
dc.date.issued	2022-06	-
dc.identifier.citation	49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022, pp.860 - 873	-
dc.identifier.issn	1063-6897	-
dc.identifier.uri	http://hdl.handle.net/10203/300888	-
dc.description.abstract	Personalized recommendation models (RecSys) are one of the most popular machine learning workload serviced by hyperscalers. A critical challenge of training RecSys is its high memory capacity requirements, reaching hundreds of GBs to TBs of model size. In RecSys, the so-called embedding layers account for the majority of memory usage so current systems employ a hybrid CPU-GPU design to have the large CPU memory store the memory hungry embedding layers. Unfortunately, training embeddings involve several memory bandwidth intensive operations which is at odds with the slow CPU memory, causing performance overheads. Prior work proposed to cache frequently accessed embeddings inside GPU memory as means to filter down the embedding layer traffic to CPU memory, but this paper observes several limitations with such cache design. In this work, we present a fundamentally different approach in designing embedding caches for RecSys. Our proposed ScratchPipe architecture utilizes unique properties of RecSys training to develop an embedding cache that not only sees the past but also the "future"cache accesses. ScratchPipe exploits such property to guarantee that the active working set of embedding layers can "always"be captured inside our proposed cache design, enabling embedding layer training to be conducted at GPU memory speed.	-
dc.language	English	-
dc.publisher	IEEE/ACM	-
dc.title	Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards	-
dc.type	Conference	-
dc.identifier.wosid	000852702500060	-
dc.identifier.scopusid	2-s2.0-85132797165	-
dc.type.rims	CONF	-
dc.citation.beginningpage	860	-
dc.citation.endingpage	873	-
dc.citation.publicationname	49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	New York	-
dc.identifier.doi	10.1145/3470496.3527386	-
dc.contributor.localauthor	Rhu, Minsoo	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 11 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards

This item is cited by other documents in WoS

KOASAS

Communities & Collections