esxcli network nic stats get -n vmnic<#>Receive CRC errors: <####> <----- This number increased
VMware NSX
VMware vSphere ESXi
vmnicX) that is dropping packets due to CRC errors.The CRC stands for "Cyclic Redundancy Check". The FCS (Frame Check Sequence) field contains a 4-byte CRC value used for error checking. When a source host assembles a packet, it performs a CRC calculation on all fields in the packet except the Preamble, SFD (Start Frame Delimiter), and FCS using a predetermined algorithm. The source host stores the value in the FCS field and transmits it as part of the packet. When the packet is received by the destination host, it performs a CRC test again by using the same algorithm. If the CRC value calculated at the destination host does not match the value in the FCS field, the destination host discards the packet, considering this as a CRC Error.
To stabilize the BGP sessions and address the host hardware faults, follow the below steps:
Migrate the Edge node: Migrate the Edge node to a different stable ESXi host. This moves the routing workload off the faulty hardware and immediately stabilizes the BGP sessions.
Engage your hardware vendor: Work with your hardware vendor to troubleshoot the physical hardware and the CRC errors occurring on the NIC cards.
Upgrade firmware and drivers: Upgrade the NIC drivers and firmware to the latest compatible combination. Ensure the ESXi host is completely stable before returning it to the cluster for production workloads.