Web Application Resource Requirements Estimation Based on the Workload Latent Features
Abstract
Most cloud computing platforms offer reactive resource auto-scaling mechanisms for dealing with variable traffic patterns to deliver the desired QoS properties while keeping low provisioning costs. However, a range of scenarios have not been fully addressed by the current auto-scaling solutions, particularly dealing with a rapid increase in workload and the risk of thrashing due to frequent workload variations. A reactive system is vulnerable in such conditions. Realizing the full potential of auto-scaling still remains challenging particularly due to the need of accurately estimating the application resource requirements for time-varying workload patterns. In this work, we propose and evaluate a novel method using only application access logs to estimate more accurately the hardware resource demands and application response time. In particular, we propose novel workload latent features which we compute by applying unsupervised learning on the access logs. We use these latent features to estimate the application hardware resource requirements and response time for various workload patterns. We evaluate the proposed method using multiple benchmark web applications and compare it with current state-of-the-art. Extensive experimental evaluations show an excellent performance of our proposed workload latent features in estimating response time, CPU, memory, and bandwidth utilization. 2008-2012 IEEE.
Collections
- Computer Science & Engineering [2402 items ]