TY - JOUR
T1 - PreVA
T2 - Predictive Vertical Autoscaler Using Multi Bi-GRU for Sustainable Cloud-Native Computing
AU - Jeon, Jueun
AU - Jeong, Byeonghui
AU - Jeong, Young Sik
N1 - Publisher Copyright:
© This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
PY - 2024
Y1 - 2024
N2 - Container resource autoscaling techniques efficiently manage container resources configured in a cloud-native computing environment. The vertical autoscaling (VA) technique provides resource elasticity by resizing a container resource in response to a generated load. However, VA techniques demonstrate inefficient scaling performance for workloads with patterns that differ from past ones because they only consider patterns of past resource usage and operate based on reactive mechanisms. Additionally, the service is temporarily disrupted by deleting and recreating containers when resources are resized. Therefore, this study proposes a predictive vertical autoscaler (PreVA) that efficiently utilizes resources and ensures service sustainability under various workload patterns. PreVA extracts temporal features from collected CPU and memory usage metrics and then trains a multi bidirectional gated recurrent unit model to forecast future resource usage with high accuracy. PreVA also utilizes forecasted resource usage to calculate optimal resource sizes for future workloads. Finally, PreVA performs rolling updates to resize resources and ensure service sustainability. PreVA is validated by performing offline simulations in a cloud-native computing environment, with approximately 90% resource utilization for various workloads. Additionally, compared with existing VA techniques, PreVA reduces the number of resource overloads and service disruptions by up to 40 and 409, respectively.
AB - Container resource autoscaling techniques efficiently manage container resources configured in a cloud-native computing environment. The vertical autoscaling (VA) technique provides resource elasticity by resizing a container resource in response to a generated load. However, VA techniques demonstrate inefficient scaling performance for workloads with patterns that differ from past ones because they only consider patterns of past resource usage and operate based on reactive mechanisms. Additionally, the service is temporarily disrupted by deleting and recreating containers when resources are resized. Therefore, this study proposes a predictive vertical autoscaler (PreVA) that efficiently utilizes resources and ensures service sustainability under various workload patterns. PreVA extracts temporal features from collected CPU and memory usage metrics and then trains a multi bidirectional gated recurrent unit model to forecast future resource usage with high accuracy. PreVA also utilizes forecasted resource usage to calculate optimal resource sizes for future workloads. Finally, PreVA performs rolling updates to resize resources and ensure service sustainability. PreVA is validated by performing offline simulations in a cloud-native computing environment, with approximately 90% resource utilization for various workloads. Additionally, compared with existing VA techniques, PreVA reduces the number of resource overloads and service disruptions by up to 40 and 409, respectively.
KW - Cloud Computing
KW - Container Resource Autoscaling
KW - Resource Management
KW - Time-Series Forecasting
UR - http://www.scopus.com/inward/record.url?scp=85202302330&partnerID=8YFLogxK
U2 - 10.22967/HCIS.2024.14.041
DO - 10.22967/HCIS.2024.14.041
M3 - Article
AN - SCOPUS:85202302330
SN - 2192-1962
VL - 14
JO - Human-centric Computing and Information Sciences
JF - Human-centric Computing and Information Sciences
M1 - 41
ER -