Google announced the addition of autoscaling to Google Compute Engine. It's now in beta, and is available to everyone.
The autoscaling, which Google describes as intelligent and horizontal, is designed by the same team behind the scaling infrastructure for Google Search and Gmail.
"Autoscaling allows customers to build more cost effective and resilient applications," says software engineer Filip Balejko. "Using Compute Engine Autoscaling, you can ensure that exactly the right number of Compute Engine instances are available at any given time to handle your application’s workload. This saves you money when your application’s usage is low, and ensures your application is responsive when utilization is high."
"The Compute Engine Autoscaler is able to intelligently and dynamically scale the number of instances in response to different load conditions by defining the ideal utilization level of a group of Compute Engine instances," he adds. "This means that when the actual utilization of your service increases or decreases, Autoscaler will detect the change and adjust the number of running instances to match. Autoscaler can respond to a number of different metrics such as CPU load, QPS on a HTTP Load Balancer and metrics defined using the Cloud Monitoring service."
The autoscaler tool chooses the right number of instances, and adapts automatically bsed on how far the current state is from the desired target.
Google demoed the product at Cloud Platform Live.
Image via Google