/var/run/log/vmkernel.log file, the following warning messages may be seen:0xB 44/00 = Aborted Command / Internal Target FailureYYYY-MM-DDTHH:MM:SSZ cpu##:#######)ALERT: hostd performance has degraded due to high system latency-----YYYY-MM-DDTHH:MM:SSZ cpu##:#######)ScsiDeviceIO: ####: Cmd(#x############) #x##, CmdSN #x######## from world ####### to dev "naa.#######" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0xb 0x44 0x0YYYY-MM-DDTHH:MM:SSZ cpu##:#######)NMP: nmp_ThrottleLogForDevice:####: Cmd #x## (#x############, #######) to dev "naa.#######" on path "vmhba#:##:##:###" Failed:
/var/run/log/vmkwarning.log, messages like "state in doubt; requested fast path state update..." may appear as well as messages stating hostd detected to be non-responsive and/or PDL (permanent device loss):YYYY-MM-DDTHH:MM:SSZ cpu###:#######)WARNING: nfnic: <#>: fnic_abort_cmd: ####: Abort for cmd tag: #x### in pending stateYYYY-MM-DDTHH:MM:SSZ cpu###:#######)WARNING: NMP: nmp_DeviceRequestFastDeviceProbe:###: NMP device "naa.#######" state in doubt; requested fast path state update...-----YYYY-MM-DDTHH:MM:SSZ Al(###) vmkalert: cpu#:#######)ALERT: hostd detected to be non-responsive-----YYYY-MM-DDTHH:MM:SSZ Wa(###) vmkwarning: cpu#:#######)WARNING: NMP: nmp_PathDetermineFailure:####: Cmd (#x##) PDL error (0x5/0x25/0x0) - path vmhba#:C#:T#:L# device naa.#### - triggering path failoverYYYY-MM-DDTHH:MM:SSZ Wa(###) vmkwarning: cpu#:#######)WARNING: NMP: nmp_DeviceRetryCommand:###: Device "naa.####": awaiting fast path state update for failover with I/O blocked. No prior reservation exists on the device./var/run/log/hostd.log) show increasing latency messages. This can happen even if the vmkernel logs do not show driver or SCSI I/O messages:YYYY-MM-DDTHH:MM:SSZ Wa(###) Hostd[#######] [Originator@#### sub=IoTracker] In thread #######, stat("/vmfs/volumes/datastoreUUID/folderName/VMname-sesparse.vmdk") took over 43799 sec.YYYY-MM-DDTHH:MM:SSZ Wa(###) Hostd[#######] [Originator@#### sub=IoTracker] In thread #######, stat("/vmfs/volumes/datastoreUUID/folderName/VMname-sesparse.vmdk") took over 43809 secALERT: hostd detected to be non-responsive" messages in the vmkernel logs.The issue may be caused by storage array performance degradation or a fabric-related issue. To resolve it:
Workarounds:
/etc/init.d/hostd restart /etc/init.d/vpxa restartesxcfg-mpath -L | grep naa.################################localcli storage san fc reset -A vmhba1localcli storage san fc reset -A vmhba2/etc/init.d/hostd restart/etc/init.d/vpxa restart "state in doubt" conditions.FC or FCoE), ensure HBA firmware and drivers are up to date and supported for the ESX version.