GRAF: A Graph Neural Network based Proactive Resource Allocation Framework for SLO-Oriented Microservices

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 219
  • Download : 0
Microservice is an architectural style that has been widely adopted in various latency-sensitive applications. Similar to the monolith, autoscaling has attracted the attention of operators for managing resource utilization of microservices. However, it is still challenging to optimize resources in terms of latency service-level-objective (SLO) without human intervention. In this paper, we present GRAF, a graph neural network-based proactive resource allocation framework for minimizing total CPU resources while satisfying latency SLO. GRAF leverages front-end workload, distributed tracing data, and machine learning approaches to (a) observe/estimate impact of traffic change (b) find optimal resource combinations (c) make proactive resource allocation. Experiments using various open-source benchmarks demonstrate that GRAF successfully targets latency SLO while saving up to 19% of total CPU resources compared to the fine-tuned autoscaler. Moreover, GRAF handles traffic surge with 36% fewer resources while achieving up to 2.6x faster tail latency convergence compared to the Kubernetes autoscaler.
Publisher
Association for Computing Machinery, Inc
Issue Date
2021-12-09
Language
English
Citation

The 17th International Conference on emerging Networking EXperiments and Technologies (CoNEXT 2021), pp.154 - 167

DOI
10.1145/3485983.3494866
URI
http://hdl.handle.net/10203/290855
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0