A vSAN cluster reports a "Yellow" health status for the MTU check (ping with large packet size) test. Concurrently, one ESXi host logs indicate intermittent link flapping (Link Down/Up) on physical uplinks, specifically affecting the Broadcom BCM57416 controller.
Similar logs in vobd.log of ESXi host:
[esx.problem.net.dvport.redundancy.lost] Lost uplink redundancy on DVPorts... Physical NIC vmnic# is down.
[vob.net.dvport.uplink.transition.up] Uplink: vmnic# is up.
VMware vSAN 8.x
Hardware: Broadcom BCM57416 NetXtreme-E 10GBASE-T RDMA Ethernet Controller
Driver: bnxtnet version 233.0.156.0
Current Firmware is reported as out-of-date
Target Firmware: 23.31.18.10 (Per VMware Compatibility Guide)
The issue is primarily caused by an outdated firmware version on the Broadcom BCM57416 network controllers. This specific firmware level is known to exhibit stability issues that lead to intermittent physical link flapping (vmnic down/up events). When a link flaps or fails to consistently handle large frames during a flapping event, the vSAN MTU health check (which utilizes 9000-byte pings for Jumbo Frame validation) fails, triggering the "Yellow" alarm.
To resolve this issue, the firmware for all Broadcom BCM57416 adapters must be updated to the version validated on the VMware Compatibility Guide.