Workload characterization for gpu partitioning-based machine learning inference serverGPU 분할 기술이 적용된 머신 러닝 추론 서버의 특성 분석 및 활용 방안 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 2
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor유민수-
dc.contributor.authorKim, Jiin-
dc.contributor.author김지인-
dc.date.accessioned2024-08-08T19:30:14Z-
dc.date.available2024-08-08T19:30:14Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097302&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321777-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iv, 25 p. :]-
dc.description.abstractMaximizing the utilization of graphics processing unit (GPU) resources becomes a crucial factor in directly reducing a data center’s Total cost of ownership (TCO). To address this, GPU partitioning technology has been developed, enabling the simultaneous execution of multiple workloads by dividing a single GPU. However, research on the analysis of characteristics when GPU partitioning technology is applied in real-world machine learning inference systems has not been actively conducted. This thesis analyzes workloads when utilizing GPU partitioning technology to enhance the efficiency of machine learning inference systems. Specifically, we focus on aspects such as resource utilization, throughput, and latency in the context of GPU partitioning. Based on the characterization results, we propose an efficient batching system for GPU partitioning-based machine learning inference to optimize the performance of the overall system further.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject머신 러닝▼a그래픽 처리 장치 분할 기술▼a머신 러닝 추론▼a배칭-
dc.subjectMachine learning▼aGPU partitioning▼aInference▼aBatching-
dc.titleWorkload characterization for gpu partitioning-based machine learning inference server-
dc.title.alternativeGPU 분할 기술이 적용된 머신 러닝 추론 서버의 특성 분석 및 활용 방안 연구-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorRhu, Minsoo-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0