(An) FPGA-based preprocessing system for GPU-partitioned machine learning inference serverGPU 분할 기술이 적용된 머신러닝 추론 서버를 위한 FPGA 기반의 전처리 시스템

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 4
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor유민수-
dc.contributor.authorYeo, Gwangoo-
dc.contributor.author여관구-
dc.date.accessioned2024-07-30T19:31:30Z-
dc.date.available2024-07-30T19:31:30Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097177&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321605-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iv, 25 p. :]-
dc.description.abstractIn machine learning inference servers, unlike training servers, inference requests are irregularly allocated and must be completed within a limited time. Consequently, operations are performed with small batch sizes, leading to inefficient utilization of GPU resources. Recent advancements in GPUs provide partitioning technology, allowing the efficient use of GPU resources by dividing a single hardware resource into independent hardware of suitable sizes for users. As this technology is implemented in inference servers, there is an increase in the processing capacity and resource utilization of GPUs. However, this leads to a bottleneck in the preprocessing stage on the CPU associated with inference requests. In this thesis research, an analysis of the bottleneck points of preprocessing stage in GPU-partitioned machine learning inference server is conducted, and proposes FPGA-based hardware design to offload the data preprocessing to increase the overall processing throughput of the ML inference server.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject중앙처리장치▼a그래픽처리장치▼aFPGA▼a머신러닝 추론 서버▼a그래픽처리장치 분할기술-
dc.subjectCPU▼aGPU▼aFPGA▼aInference server▼aGPU partitioning technique-
dc.title(An) FPGA-based preprocessing system for GPU-partitioned machine learning inference server-
dc.title.alternativeGPU 분할 기술이 적용된 머신러닝 추론 서버를 위한 FPGA 기반의 전처리 시스템-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorRhu, Minsoo-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0