A specific condition in the Emulex lpfc driver can lead to complete ESXi host management failure and Virtual Machine (VM) deadlocks. This typically occurs during SCSI device discovery when a storage controller is rejoined to a fabric, causing the driver's worker thread to become overwhelmed and leading to "XRI Starvation."
2026-05-14T14:16:23.953Z In(14) vobd[2097955]: [vmfsCorrelator] [vob.vmfs.heartbeat.timedout] 69e8dfd8-########-####-############ [DATASTOR_NAME]
2026-05-14T14:16:23.958Z Wa(180) vmkwarning: cpu46:2098429)WARNING: lpfc : vmhba4 lpfc_validate_fcp_abort:7541: 3111 Outstanding FCP I/O Abort Request still pending on io_buf 0x45daad3af430, xri x728
2026-05-14T14:16:17.923Z Wa(180) vmkwarning: cpu11:2098175)WARNING: lpfc : vmhba4 lpfc_sli4_eratt_read:8275: 2885 Port Status Event: port status reg 0x81800000, port smphr reg 0xc000, error 1=0x2e004a01, error 2=0x218
2026-05-14T14:18:23.064Z Wa(180) vmkwarning: cpu2:9761735)WARNING: ScsiDeviceIO: 13515: IO stuck on device naa.600507680c8104f8b80000000000037c for more than 120000 seconds
lpfc (Versions prior to 14.4.576.11)The lpfc worker thread processes fabric discovery and I/O completions. When a storage controller is re-added, the driver initiates a SCSI scan. If the REPORT LUN command fails or experiences high latency, the driver calls the ESX API vmk_ScsiScanAndClaimPaths() directly within the worker thread, causing it to hang. This leads to mailbox timeouts, buffer exhaustion (XRI Starvation) and a host-wide deadlock where all I/O is blocked.
The issue is resolved in lpfc driver version 14.4.576.11 and higher. The fix offloads the ESX API call to the pathclaim world, preventing discovery failures from blocking the primary driver service thread.
lpfc driver version running on the ESXi host.If an immediate driver update is not possible: