High latency observed on the virtual machines residing on the ESXi host
In the /var/log/vmkernel.log file of the ESXi host, entries similar to below will be seen:
####-##-#####:##:##.###
<hostname> vmkernel: 116:03:44:19.039 cpu4:4196)<6>qla2xxx 0000:0f:00.0: scsi(6:0:152): Abort command issued -- 1 67a23dcd 2002.
####-##-#####:##:##.###
<hostname>
vmkernel: 116:03:44:19.039 cpu4:4100)NMP: nmp_CompleteCommandForPath: Command 0x2a (0x4100020e0b00) to NMP device "sym.##################" failed on physical path "vmhba2:C0:T0:L152" H:0x2 D:0x0 P:0x0 Possible sense data: 0x0 0x0 0x0.
####-##-#####:##:##.###
<hostname>
vmkernel: 116:03:44:19.039 cpu4:4100)WARNING: NMP: nmp_DeviceRequestFastDeviceProbe: NMP device "sym.##################" state in doubt; requested fast path state update...</time>
####-##-#####:##:##.###
cpu49:2099091)ScsiDeviceIO: 4115: Cmd(0x45cbdcdf6848) 0x88, CmdSN 0x34d from world 9366736 to dev "naa.600aXXXXXXXXXXXXXXXXXXXXXXXXXXXX
" failed H:0x2 D:0x0 P:0x0
####-##-#####:##:##.###
cpu169:2099095)WARNING: ScsiDeviceIO: 1513: Device naa.600aXXXXXXXXXXXXXXXXXXXXXXXXXXXX performance has deteriorated. I/O latency increased from average value of 1065 microseconds to 22027 microseconds.
####-##-#####:##:##.###
cpu50:8436000)qlnativefc: vmhba3(33:0.0): qlnativefcStatusEntry:1922:(5:45) Dropped frame(s) detected(524288 of 524288 bytes).
VMware vSphere ESXi 7.x
VMware vSphere ESXi 8.x
scsi(6:0:152)
, which translates to LUN 152:<YYYY-MM-DD>T<time></time> <hostname>
vmkernel: 116:03:44:19.039 cpu4:4196)<6>qla2xxx 0000:0f:00.0: scsi(6:0:152): Abort command issued -- 1 67a23dcd 2002
0x2a
, which is a 10 byte WRITE
command. Our mid-layer returns a Host status of 0x2
, which translates to DID_BUS_BUSY
. This is a catch-all message produced as a direct result of the QLogic driver aborting the command:<YYYY-MM-DD>T<time></time> <hostname>
vmkernel: 116:03:44:19.039 cpu4:4100)NMP: nmp_CompleteCommandForPath: Command 0x2a (0x4100020e0b00) to NMP device "sym.##################" failed on physical path "vmhba2:C0:T0:L152" H:0x2 D:0x0 P:0x0 Possible sense data: 0x0 0x0 0x0.
TUR or TEST_UNIT_READY
command down the path in which the command did not complete to ensure that this path is still good to use. A TUR
command is issued every 300 seconds down each path as part of the path evaluation code (Disk.PathEvalTime
). However, in this instance a TUR
command is issued immediately due to the failed command.<YYYY-MM-DD>T<time></time> <hostname>
vmkernel: 116:03:44:19.039 cpu4:4100)WARNING: NMP: nmp_DeviceRequestFastDeviceProbe: NMP device "sym.##################" state in doubt; requested fast path state update...