Auto-Scaling
WHAT IS IT? [Azure Autoscale / Google Autoscaling]
Auto-scaling helps you increase the number of servers in action to ensure your system can handle all the traffic coming its way. There are three distinct autoscaling options on AWS at present:
1 : Amazon EC2 Scaling focuses on EC2—and nothing else.
2 : Application autoscaling is used to control that which isn’t EC2. You can use this when you’re working with DynamoDB, elastic containers, Apache tools, and so on.
3 : AWS Auto Scaling provides a scalable predictive feature and provides a central way to manage scalability.
When you’re configuring an auto-scaling group, you’ll want to think of four parameters: how many servers do you want to maintain uptime, do you want to adjust your server count manually, do you want to schedule when to scale up or down, or would you like it to be based on conditions with your product performance.
WHAT’S THE FUSS?
Let’s say you’ve just launched a new product, and suddenly you have a swarm of users frantically trying to pre-order it through your app or website. If you haven’t configured your auto-scaling correctly—say, that you expected them to show at a different time instead of right at an announcement—you’re about to collapse under the weight of your success🤡
Not only do you have to make the right call on which auto-scaling option you want to use, but you’ll also have to ensure you pick the right route for auto-scaling. Having the wrong configuration or selections here can be quite costly, and you’re probably going to be attempting a differential diagnosis of something right in the middle of some problem with your scaling.