AlphaR: Learning-Powered Resource Management for Irregular, Dynamic Microservice Graph

doi:10.1109/IPDPS49936.2021.00089

Proceedings ArticleDOI

AlphaR: Learning-Powered Resource Management for Irregular, Dynamic Microservice Graph

- pp 797-806

TLDR

In this paper, a learning-powered resource management system tailored to the microservice environment is proposed, which can improve the mean and p95 response time by up to 80% and 77.5% respectively compared with conventional schemes.

Abstract:

The microservice architecture is a hot trend which proposes to transform the traditional monolith application into massive dynamic and irregular small services. To boost the overall throughput and ensure the guaranteed latency, it is desirable to process massive service requests in parallel with efficient resource sharing in data centers. However, the disaggregation nature of microservice unavoidably upscales the design space of resource management and increases its complexity. In this paper, we propose AlphaR, a learning-powered resource management system tailored to the microservice environment. The basic idea of AlphaR is to generate microservice-specific resource management policies for improving efficiency. Specifically, we take the first step to use bipartite graph as a convenient abstraction for application built with microservices. Based on this, we devise a bipartite feature inference approach named Bi-GNN to extract the temporal characteristics of microservices. Furthermore, we implement a policy network to select appropriate resource allocation choices for maximizing the performance in resource-constrained data centers. AlphaR can improve the mean and p95 response time by up to 80% and 77.5% respectively compared with conventional schemes.

AlphaR: Learning-Powered Resource Management for Irregular, Dynamic Microservice Graph

Citations

Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum

Practical Efficient Microservice Autoscaling with QoS Assurance

Layer-aware Collaborative Microservice Deployment toward Maximal Edge Throughput

A Lightweight Workload-Aware Microservices Autoscaling with QoS Assurance

A Survey on Graph Neural Networks for Microservice-Based Cloud Applications

References

Usability Engineering

Graph Attention Networks

Graph Neural Networks: A Review of Methods and Applications

The tail at scale

Dominant resource fairness: fair allocation of multiple resource types

Related Papers (5)

Joint optimization of service request routing and instance placement in the microservice system

Dyme : Dynamic Microservice Scheduling in Edge Computing Enabled IoT

Multi-resource schedulable unit for adaptive application-driven unified resource management in data centers

AI-Driven Collaborative Resource Allocation for Task Execution in 6G-Enabled Massive IoT

5G network-oriented hierarchical distributed cloud computing system resource optimization scheduling and allocation