During a host update using Lifecycle Manager (LCM), you see the host become unresponsive while VMs are being vMotioned attempting to enter Maintenance Mode. You may experience the following:
The operation is not allowed in the current state. Host <hostname> cannot enter maintenance mode due to host latch failure.vmkwarning: cpu160:2099710)WARNING: ScsiDeviceIO: 1780: Device naa.600############################# performance has deteriorated. I/O latency increased from average value of 1883 microseconds to 58345 microseconds.
Wa(###) Hostd[#######]: [Originator@#### sub=IoTracker] In thread #######, realpath("/vmfs/volumes/########-########-####-############/<VM directory>/<VM>.vmdk") took over 7 sec.ESXi 8.0
This issue typically occurs when high system latency or storage-related delays impact the responsiveness of the ESXi management service, hostd. As a result, the host becomes unresponsive to management operations while still appearing connected in vCenter.
The resolve the issue, perform the following steps:
Perform a hard reset or power cycle of the physical host if management agents (hostd) are completely unresponsive.
Resume the host update task in Lifecycle Manager.