NSX-T
In the event of a network disruption where Service Engines (SEs) get disconnected, the Controller attempts to scale out the Virtual Services (VS) to different Service Engines.
After the scale-out, the Controller initiates a scale-in for the older Service Engine. Before doing so, it checks if cloud programming is complete on all the SEs. However, if cloud programming is not completed for the newly added SE, it can prevent the scale-out from being successful.
If the old SE reconnects within a few seconds and the scale-in never occurs, it can result in a situation where the VIPs are asymmetrically distributed.
Due to a race condition, when an SE disconnects and reconnects with the Controller some time later, the virtual service placed on that SE can end up in a state where it is placed on more SEs than the number of SEs requested configured in the SE Group. In the case of Parent-Child VS it can cause traffic issues if the child VS is disabled/enabled in this state, causing the Parent and Child VS to be placed on a different set of SEs. This can lead to the child FQDN being handled sometimes by the Parent VS and sometimes by the Child VS.
This bug is for both SNI VS and shared vip scenario.
Fix Version: 22.1.4, 22.1.6, 30.1.1, 30.2.1