When attempting to expand an Edge Cluster from SDDC Manager, it fails during the initial checks with the message "Validation for Edge cluster specification failed, please check the table for details."
In the SDDC Manager /var/log/vmware/vcf/domainmanager/domainmanager.log
the following log lines are found:
2024-08-19T18:34:35.727+0000 DEBUG [vcf_dm,############################d160,##f1] [c.v.v.c.n.s.c.c.NsxtManagerHostSwitchOperationsPolicyHandler,dm-exec-4] nsx-policy-uplinkprofile getAllUplinkHostSwitchProfiles
2024-08-19T18:34:35.747+0000 DEBUG [vcf_dm,
############################
d160,##f1] [c.v.v.c.n.s.c.c.ApiConnection,dm-exec-4] Closed ApiClient connection.2024-08-19T18:34:35.749+0000 ERROR [vcf_dm,
############################
d160,##f1] [c.v.v.n.c.v.v.NsxTEdgeClusterValidator,dm-exec-4] Exception during cluster resources check for expanding edge cluster com.vmware.evo.sddc.orchestrator.exceptions.OrchTaskException: Failed to discover resources for NSX edge cluster ########-####-####-####-########98af
at com.vmware.vcf.common.fsm.plugins.nsxt.action.DiscoverEdgeClusterNsxtResourcesAction.execute(DiscoverEdgeClusterNsxtResourcesAction.java:196)
at com.vmware.vcf.nsxtedgeclustermanager.controller.v1.validation.NsxTEdgeClusterValidator.validateAsyncEdgeExpansionSpec(NsxTEdgeClusterValidator.java:1311)
at com.vmware.vcf.nsxtedgeclustermanager.controller.v1.NsxTEdgeClusterController.lambda$triggerExpansionValidationAsync$4(NsxTEdgeClusterController.java:700)
at java.base/java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1804)
at com.vmware.vcf.common.tracing.TraceRunnable.run(TraceRunnable.java:59)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
at java.base/java.lang.Thread.run(Thread.java:833)
Caused by: java.lang.NullPointerException: Cannot invoke "java.lang.Integer.intValue()" because the return value of "com.vmware.vcf.common.nsxt.sdk.client.model.UplinkHostSwitchProfile.getMtu()" is null
at com.vmware.vcf.common.fsm.plugins.nsxt.action.DiscoverEdgeClusterNsxtResourcesAction.formOverlayVlanToUplinkProfileIdMap(DiscoverEdgeClusterNsxtResourcesAction.java:341)
at com.vmware.vcf.common.fsm.plugins.nsxt.action.DiscoverEdgeClusterNsxtResourcesAction.discoverEdgeClusterResources(DiscoverEdgeClusterNsxtResourcesAction.java:232)
at com.vmware.vcf.common.fsm.plugins.nsxt.action.DiscoverEdgeClusterNsxtResourcesAction.execute(DiscoverEdgeClusterNsxtResourcesAction.java:190)
... 7 common frames omitted
SDDC
VMware NSX
In NSX Manager, the Uplink Profile in use by the edge cluster has a blank value. This is acceptable and legal in NSX as a blank value ensures the Uplink Profile accepts the default value of 9000. The error message is caused by SDDC Manager rejecting the null value.
This will be resolved in an upcoming release of SDDC Manager.
Workaround: Via the NSX Manager GUI, configure an MTU value in the Uplink Profile in use by the NSX Edge Cluster specified in the error message.
Guidance for setting the correct MTU size in an NSX environment can be found in Guidance to Set Maximum Transmission Unit.