When upgrading to Resource/Isolation Segment 10.4.0 or 10.4.1 from 10.3.x, two distinct failure modes occur regarding the isolated_router job:
om config-template and applying the configuration via om configure-product. Isolated routers are deleted during an upgrade to Resource/Isolation Segment 10.4.0 or 10.4.1. Reachability to the public Isolation Segment is lost. Cloud platform backends for the load balancer appear empty. Inspection of the update logs reveals the isolated_router instance group being scaled from its original count to zero. E.ginstance_groups:
- name: isolated_router
- instances: 3
+ instances: 0isolated_router to 0 in the 10.3.x environment and staging 10.4.0 or 10.4.1 normally without using config-template.
The 10.4.x release introduces a new boolean tile property, routing_enable_gorouter_vms ("Enable routing VMs"), which controls whether the isolated_router job deploys.
When set to false, a configuration constraint forces the isolated_router instance count to 0, deleting the VMs. When set to true, the constraint lifts and the instance count defaults to 3. A migration script handling the 10.3.x to 10.4.x upgrade path unconditionally sets this new property to true without inspecting the prior isolated_router instance count.
routing_enable_gorouter_vms property lacks a default value in the tile configuration structure. Therefore, the om config-template command outputs the property as absent or false. Applying that configuration via om configure-product overrides the migration script's true value back to false. The constraint then forces isolated_router instances to 0, and Apply Changes deletes all router VMs.routing_enable_gorouter_vms to true regardless of the prior instance count. With the property set to true, the constraint no longer applies, causing the instance count to reset to the tile default of 3.This will be addressed in upcoming patch release. Fix would contain a corrected migration script that conditions the property value on the prior instance count and adds a default true value to the property configuration.
To apply immediate workarounds on10.4.0 or 10.4.1 before upgrading:
Workaround for Failure Mode 1 (Routers Deleted):
Navigate to the Resource Segment tile in Ops Manager.
Locate the Enable routing VMs property within the networking configuration.
Ensure the checkbox is enabled (set to true).
Verify the Resource Config for the Isolated Router reflects the correct number of instances (e.g., 3).
Apply Changes.
Workaround for Failure Mode 2 (Unexpected Routers Created):
Stage the 10.4.0 or 10.4.1 release.
Before running Apply Changes, verify the Enable routing VMs property within the networking configuration.