In a VMware vSphere environment, storage connectivity instability caused by Fiber Channel (FC) I/O aborts can lead to severe ESXi host management failures. You may experience the following symptoms:
Review the following logs for evidence of FC I/O aborts:
ESXi 8.x
The issue is caused by constant Fiber Channel (FC) I/O aborts on specific Host Bus Adapter (HBA) adapters. These aborts trigger VMFS heartbeat timeouts, causing the ESXi host to lose connectivity to storage volumes. This ultimately stalls the management agents due to severe I/O threads being stuck waiting for storage responses.
To restore stability and storage connectivity, follow these recovery procedures:
1. Identify Affected HBA Adapters:
Review vmkernel.log and vobd.log for the following indicators:
2. Reset HBA via CLI
Attempt to re-initialize the HBA driver and hardware without a full host reboot. Run the following command via the ESXi CLI for the affected adapter:
Review this command before running it.
localcli storage san fc reset -A <vmhba_ID>
Note: Replace <vmhba_ID> with the affected adapter (e.g., vmhba1).
3. Coordinate SAN Fabric Review: If resets do not stabilize the links, work with your SAN team to:
If resets do not stabilize the links:
4. Perform Host Reboot:
If management agents remain unresponsive after an HBA reset, a hard reboot of the ESXi host is required to clear stuck I/O threads.