To resolve this issue, determine the worlds, such as the virtual machine, user-worlds, or system processes, that are accessing the VMFS volume(s) in an APD state using this command:
# localcli storage core device world list
You see an output similar to:
Device World ID Open Count World Name
----------------------------------------------------------------------
naa.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 2060 1 idle0
You can then forcefully stop all worlds that access the device which is in the APD state and remove the dead paths.
To stop all worlds that access the device in an APD state and to remove the dead paths:
- Run this command to list the virtual machine worlds currently running on the host:
# localcli vm process list
You see an output similar to:
<VM-NAME>:
World ID: ######
Process ID: 0
VMX Cartel ID: ######
UUID: ## ## ## ## ## ## ## ##-## ## ## ## ## ## ## ##
Display Name: <VM-NAME>
Config File: /vmfs/volumes/########-########-####-############/<VM-NAME>/<VM-NAME>.vmx
- Run this command to kill the virtual machine World ID: processes:
# localcli vm process kill --type=force --world-id <World ID>
For example:
# localcli vm process kill --type=force --world-id=12346
- Rescan to remove dead paths using this command:
# localcli storage core adapter rescan -A vmhbaX -t delete
The ESXi host itself may have open handles to the affected VMFS volume which is in the APD state. WorkID 2060 (idle0) is a core system process that is used when a VMFS volume is opened or mounted to an ESXi host. If you attempt to remove dead paths while the ESXi host has open handles, you see this error:
Errors:
Rescan complete, however some dead paths were not removed because they were in use by the system. Please use the 'storage core device world list' command to see the VMkernel worlds still using these paths.
Error while scanning interfaces, unable to continue. Error was Not all VMFS volumes were updated; the error encountered was 'IO was aborted by VMFS via a virt-reset on the device'.
Note: This process cannot be forcefully shutdown or terminated as it is a critical system process. The only option available is to reboot all affected ESXi host(s).