"Configuration file cannot be found"
Most of crashed Windows VMs report BSOD and following error observed in vmware.log of crashed virtual machine:
YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03) vcpu-5 - WinBSOD: Synthetic MSR[0x40000100] 0x7aYYYY-MM-DDTHH:MM:SS.XXXZ Wa(03)+ vcpu-5 -YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03) vcpu-5 - WinBSOD: Synthetic MSR[0x40000101] 0xffffe44c84ee8878YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03)+ vcpu-5 -YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03) vcpu-5 - WinBSOD: Synthetic MSR[0x40000102] 0xffffffffc0000185YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03)+ vcpu-5 -YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03) vcpu-5 - WinBSOD: Synthetic MSR[0x40000103] 0x20009f6cbbe0YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03)+ vcpu-5 -YYYY-MM-DDTHH:MM:SS.XXXZ Wa(03) vcpu-5 - WinBSOD: Synthetic MSR[0x40000104] 0xffff9909dd10f000
YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2102143]: [Originator@6876 sub=Libs] DictionaryLoad: Cannot open file "/vmfs/volumes/vsan:################-################/########-####-####-####-#############/################.vmx": Input/output error.
YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2111502]: [Originator@6876 sub=Libs] OBJLIB-LIB: ObjLib_GetNameSpaceObjectUniqueIdFromPath : failed to obtain canonical path for "/vmfs/volumes/vsan:################-################/########-####-####-####-#############/################.vmx": Input/output error. (5)
VM namespace object shows up as not found and reports heartbeat timeout:
(from /var/run/log/hostd.log)
YYYY-MM-DDTHH:MM:SS.XXXZ Wa(164) Hostd[2102141]: [Originator@6876 sub=Hostsvc.VmkVprobSource] Can't find datastore '########-####-####-####-#############'
YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2102141]: [Originator@6876 sub=Hostsvc.VmkVprobSource] VmkVprobSource::Post event: (vim.event.EventEx) {YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> key = 127,YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> chainId = 0,YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> createdTime = "1970-01-01T00:00:00Z",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> userName = "",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> host = (vim.event.HostEventArgument) {YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> name = "ESXi-host.domain.com",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> host = 'vim.HostSystem:ha-host'YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> },YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> eventTypeId = "esx.problem.vmfs.heartbeat.timedout",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> arguments = (vmodl.KeyAnyValue) [YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> (vmodl.KeyAnyValue) {YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> key = "1",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> value = "########-####-####-####-#############"YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> },YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> (vmodl.KeyAnyValue) {YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> key = "2",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> value = "########-####-####-####-#############'"YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> }YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> ],YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> objectId = "ha-host",YYYY-MM-DDTHH:MM:SS.XXXZ In(166) Hostd[2101673]: --> objectType = "vim.HostSystem",
YYYY-MM-DDTHH:MM:SS.XXXZ No(29) clomd[12612580]: [Originator@6876] CLOM_AddToQueuedObjectList: Queueing workitem for ########-####-####-####-############# type: REPAIR, priority: 19 with delay 0YYYY-MM-DDTHH:MM:SS.XXXZ No(29) clomd[12612580]: [Originator@6876] CLOMCleanup_ConfigNeedsCleanup: Object ########-####-####-####-############# needsCleanup: 0, needsDeltaCleanup: 0, needsRepairCleanup: 0, needsConsolidateCleanup: 0, needsTransientCleanup: 0, needsRefinalize: 0, inCrawler: 0YYYY-MM-DDTHH:MM:SS.XXXZ No(29) clomd[12612580]: [Originator@6876] CLOM_PostWorkItem: Posted a work item opID:1804388030 for ########-####-####-####-############# group: 00000000-0000-0000-0000-000000000000 Type: REPAIR delay 0 (Success)YYYY-MM-DDTHH:MM:SS.XXXZ No(29) clomd[12612580]: [Originator@6876 opID=1804388030] CLOMReconfigure: Reconfiguring ########-####-####-####-############# workItem type REPAIRYYYY-MM-DDTHH:MM:SS.XXXZ Er(27) clomd[12612580]: [Originator@6876 opID=1804388030] CLOMReplacementPreWorkRepair: Repair needed. 1 absent/degraded data components for ########-####-####-####-#############VM Namespace UUID reports heartbeat timeout:
(from /var/run/log/vobd.log)
YYYY-MM-DDTHH:MM:SS.XXXZ In(14) vobd[2097811]: [vmfsCorrelator] 10682910301000us: [vob.vmfs.heartbeat.timedout] ########-####-####-####-############# ########-####-####-####-#############YYYY-MM-DDTHH:MM:SS.XXXZ In(14) vobd[2097811]: [vmfsCorrelator] 10683827650869us: [esx.problem.vmfs.heartbeat.timedout] ########-####-####-####-############# ########-####-####-####-#############
VMware vSAN 8.x
YYYY-MM-DDTHH:MM:SS.XXXZ [536236267] [cpu0] [4549e681bf08] RDTTraceSlowMessageTx:5183: {'newTxState': 'Acked', 'oldTxState': 'Sent', 'type': 'Response', 'opId': 0x4ae09942, 'requestId': 2, 'lastTimeMS': 89680, 'totalMS': 0, 'bytesAtEnd': 3776, 'bytes': 132, 'assoc': 0x4316c7b0b880, 'conn': 0x4316c7966500}YYYY-MM-DDTHH:MM:SS.XXXZ [536236270] [cpu0] [4549e6825c08] RDTTraceSlowMessageTx:5183: {'newTxState': 'Acked', 'oldTxState': 'Sent', 'type': 'Response', 'opId': 0x4ae09948, 'requestId': 2, 'lastTimeMS': 89675, 'totalMS': 0, 'bytesAtEnd': 3888, 'bytes': 112, 'assoc': 0x4316c7e9a9c0, 'conn': 0x4316c7966500}YYYY-MM-DDTHH:MM:SS.XXXZ [536236273] [cpu0] [4549e6803a08] RDTTraceSlowMessageTx:5183: {'newTxState': 'Acked', 'oldTxState': 'Sent', 'type': 'Response', 'opId': 0x4ae098fb, 'requestId': 2, 'lastTimeMS': 89671, 'totalMS': 0, 'bytesAtEnd': 4020, 'bytes': 132, 'assoc': 0x4316c83c8e00, 'conn': 0x4316c7966500}
Investigate network disconnection between data nodes and Witness node to understand why network drops are observed between them and why Witness node takes longer than expected to respond.