Error Details:
From HCX Manager -> Administration -> Alerts, the status reports:
“Component Type: VirtualMachine
Message: Appliance (<Fleet_Appliance_Name>) status changed from unknown to down. Overall transport tunnel status is down. Overall encryption tunnel status is down. Service pipeline status is down.
InstanceID: <ID>”
From HCX Manager -> Interconnect -> Service Mesh -> View Appliances. Expand the appliance with status DOWN.
Sometimes, the tunnel may resume connectivity with a status of UP/Degraded. However, if it remains DOWN/Degraded, you can initiate "Run Diagnostics" to pinpoint the exact location of the traffic interruption. To execute Service Mesh Diagnostics, go to Interconnect -> Service Mesh -> Run Diagnostics.
VMware HCX
The "Overall transport tunnel status is down" error indicates an underlay network issue. Specifically, the uplink IP used for transport (UDP port 4500) is unable to communicate between the Fleet Appliances.
To troubleshoot the error related to HCX Service Mesh IX/NE appliances, you can follow these steps:
SSH to HCX Manager, access ccli, list appliances, navigate to logs, and search for errors:
- ssh admin@hcx-manager-ip
- Once connected, access ccli and list appliances ccli list
- Identify the IX/NE appliance (replace # with the specific appliance identifier) go #
- SSH to appliancessh
-Grep for "DOWN" in messagesgrep DOWN /var/log/messages
# Navigate to log directory and search for 'Traceroute' in messages log cd /var/log/ less messages (Find String "Traceroute" you can do this by using / or ?).
Use the following command to test port connectivitycurl -kv telnet://<IP address>:<port number>
Analyze the logs to pinpoint where the traffic flow is obstructed
less command, use N to move to the next occurrence and review entries sequentially.Further troubleshooting based on findings:
For information on the required ports, access VMware Ports and Protocols and Network Diagrams for VMware HCX.