When you use vSphere Lifecycle Manager to remediate a vSAN cluster and place a host into maintenance mode for patching, you may find that the remediation process fails with the following reported errors.
Remediation failed
vSAN health check 'vSAN: Basic (unicast) connectivity check' reported an issue for cluster [Redacted]. Check the vSAN health.
vSAN health check 'vSAN: MTU check (ping with large packet size)' reported an issue for cluster [Redacted]. Check the vSAN health.
vSAN health check 'vMotion: Basic (unicast) connectivity check' reported an issue for cluster [Redacted]. Check the vSAN health.
vSAN health check 'vMotion: MTU check (ping with large packet size)' reported an issue for cluster [Redacted]. Check the vSAN health.
Health Check for [Redacted] failed.
[Redacted].com - Skipped remediation for this host.
Host [Redacted] was not processed, the reason: 'Health Check for...'
vSAN
The alarm failures are attributed to physical network latency or configuration issues outside of the VMware software stack.
Follow KB https://knowledge.broadcom.com/external/article/389049/vsan-skyline-health-reports-errors-vsan.html
To Troubleshoot start with using the vSAN Skyline health test in vCenter.
If the problem continues Then a Ping should be conducted on the faulty host(s) from a working host using a 1500 MTU and 9000 MTU.
Log into the ESXi Host(s) via Putty/SSH to access the CLI.
Test the connectivity between hosts.
Example output of failed vmkping tests:
PING xx.xx.xxx.xx ( xx.xx.xxx.xx): 1472 data bytes
--- xx.xx.xxx.xx ping statistics ---
3 packets transmitted, 0 packets received, 100% packet loss
PING xx.xx.xxx.xx ( xx.xx.xxx.xx): 8972 data bytes
--- xx.xx.xxx.xx ping statistics ---
3 packets transmitted, 0 packets received, 100% packet loss
if either of the above tests fail. This confirms there is an issue preventing network connectivity between the individual ESXi hosts defined vSAN vmkernel ports.
Physical Network Checks need to be conducted upstream (physical network) from the Hypervisor with the physical network administrator to determine the underlying cause of the failing connectivity health.
https://knowledge.broadcom.com/external/article/391812/vsan-skyline-health-reports-errorsvsan-m.html
https://knowledge.broadcom.com/external/article/326954/troubleshooting-vsan-networking.html
https://knowledge.broadcom.com/external/article?articleNumber=379982
https://knowledge.broadcom.com/external/article/389049/vsan-skyline-health-reports-errors-vsan.html