How does Azure VMSS handle existing request while scaling down or scaling in?

About

Asked 4/2, 2019 at 16:21 Answered 4/2, 2019 at 20:14

Solved azure azure-vm-scale-set horizontal-scaling

I have a VMSS with instance count say 3.Lets say I specified that if CPU utilization is <20%, then reduce instance by 3 to 1. Assume that these 3 instances were serving some request and let's say each request take 60 seconds to complete.

Assume at this moment CPU utilization reached 15%, so instance count should reduce by 2. So at this moment what will happen with the existing request which was serving by other two instances. Do these instances shift their ongoing process to other instance or it would not reduce the count until they complete the ongoing request?

I already have attached the scale set with Application Gateway and enabled the connection draining so that ongoing process should not drop. But it is dropping. As it fails I am trying to do something using API management Revision & Version.

Expectation: Once scale down/scale in happens, ongoing requests should not drop.

Adviser answered 4/2, 2019 at 16:21 Comment(1)

How did you solve the issue? Can you also tell me on how to connect VMSS with application gateway. I am getting issues mentioned here (reddit.com/r/AZURE/comments/cfjnn6/…) – Olshausen 21/7, 2019 at 1:19

The scale set has no understanding of what is going on in your VM and what requests are ongoing. When you reach the threshold for scale down then your VM will be removed and any existing requests will fail.

You should be using a load balancer in front of your scale set to ensure that traffic is no longer sent to the VMs being shut down. Your application needs to be built to retry requests if they fail due to scale down.

Rhombohedral answered 4/2, 2019 at 20:14 Comment(3)

Thanks a lot, Mr. Sam for your valuable comment. Could you please help me getting any other workaround/idea to achieve the zero downtime application except Kubernetes(containerization) implementation. – Adviser 5/2, 2019 at 4:28

Build your application to retry requests if they fail. – Rhombohedral 5/2, 2019 at 8:19

There is "termination notification" that can be captured through instance metadata localhost curl. It is a new feature: azure.microsoft.com/en-us/blog/… – Practical 8/7, 2020 at 8:7

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags