An error occurred while saving: 110 (Connection timed out). An error occurred while taking a snapshot: Unable to save snapshot file. An error occurred while saving snapshot file "/vmfs/volumes/[Datastore]/[VM-Name]/[VM-Name]-Snapshot.vmsn
/vmfs/volumes/[datastore]/[vm_name]/vmware.log file during the same time the snapshot with memory is taken, we see the below events:-YYYY-MM-DDTHH:MM:SSZ In(05) worker-###### - MainMem: Failed to write memory pages [306560,306687] at 0: Timeout.
YYYY-MM-DDTHH:MM:SSZ In(05) vcpu-0 - MainMem: Failed to write memory pages [439680,439807] at 0: Timeout.
YYYY-MM-DDTHH:MM:SSZ In(05) worker-###### - MainMem: End lazy IO (346752 done, sync = 0, error = 2).
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - MainMem: Completed pending lazy checkpoint save (0).
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - DUMPER: Partial save write failure.
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - SnapshotVMXTakeSnapshotWork: Transition to mode 1.
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - SnapshotVMXTakeSnapshotComplete: Done with snapshot 'VM Snapshot : 2312
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - SnapshotVMXTakeSnapshotComplete: Snapshot 2312 failed: Unable to save snapshot file (13).
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - SnapshotVMXTakeSnapshotComplete: Cleaning up incomplete snapshot 2312.
YYYY-MM-DDTHH:MM:SSZ In(05) vmx - SnapshotVMXTakeSnapshotComplete: Deleting incomplete snapshot 2312.
"state in doubt; requested fast path state update" error from the host's vmkernel logs in the /var/run/log/vmkernel.log file path :-YYYY-MM-DDTHH:MM:SSZ Wa(180) vmkwarning: cpu23:2098278)WARNING: NMP: nmp_DeviceStartLoop:790: NMP Device "eui.#######################" is blocked. Not starting I/O from device.
YYYY-MM-DDTHH:MM:SSZ Wa(180) vmkwarning: cpu77:2098271)WARNING: NMP: nmp_DeviceRequestFastDeviceProbe:235: NMP device "eui.####################" state in doubt; requested fast path state update
During the same time the snapshot gets failed, we see the "state in doubt; requested fast path state update" error from the vmkernel logs.
Host Bus Adapter (HBA) driver aborts a command because the command took longer than the timeout period of 5 seconds to complete. An operation can take longer than the timeout period because of several reasons including:
Engage your storage vendor to check if the storage is getting overloaded if it is observed that the device utilization is hitting 100%.
If storage utilization is under control and IO aborts are observed, engage your fabric vendor and ensure that the fabric is healthy
"state in doubt; requested fast path state update" error in vmkernel.log