WARNING: Heartbeat: 961: PCPU 40 didn't have a heartbeat for 5 seconds, timeout is 10, 1 IPIs sent; *may* be locked up.
WARNING: Heartbeat: 961: PCPU 41 didn't have a heartbeat for 15 seconds, timeout is 10, 2 IPIs sent; *may* be locked up.
The unexpected reboot is caused by hardware-level failures in specific physical CPU cores (PCPUs). When one or more physical CPU cores become unresponsive, the ESXi heartbeat monitoring system detects that these cores are not responding to Inter-Processor Interrupts (IPIs). After multiple failed attempts to communicate with the locked-up cores, the server experiences a fault condition that triggers a reboot.
These heartbeat failures are symptomatic of physical CPU hardware issues that cannot be resolved through software configuration changes.
Since this is a hardware-related issue, the following steps should be taken:
Place the affected host in maintenance mode to prevent workloads from being impacted if the issue recurs.
Review the ESXi host vmkernel.log to confirm PCPU heartbeat failure messages.
Contact your server hardware vendor to perform comprehensive hardware diagnostics
Update server firmware:
Check for and apply the latest BIOS updates for your server model.
Update any related firmware components (chipset, management controllers).
Apply any microcode updates available for your processor model.
If the issue persists after firmware updates, work with your hardware vendor for resolution.