PSOD with Event: LINT1/NMI - AeiHESTNmiHandler requested PSOD
search cancel

PSOD with Event: LINT1/NMI - AeiHESTNmiHandler requested PSOD

book

Article ID: 399748

calendar_today

Updated On:

Products

VMware vSphere ESXi

Issue/Introduction

ESXi Server crashes with PSOD in response to Non Maskable Interrupt (NMI) triggered due to erroring PCI Device. 

Crash logs/Screen Capture contain event similar to the following
cpu0:2098206)ApeiHEST: 233: Invoked HestNMIHandler
cpu0:2098206)ApeiHEST: 259: Uncorrectable Errors
cpu0:2098206)ApeiHEST: 294: Error Event Severity: Fatal
cpu0:2098206)ALERT: ApeiHEST: 327: Fatal error from 0000:XX:00.0(PCI Express Endpoint), VID:####, DID:#### DevSts: 0xd, AERUeSts: 0x2000.
cpu0:2098206)NMI: 1031: ApeiHESTNmiHandler requested PSOD

Cause

This PSOD is in response to an NMI raised by CPU as part of notifying VMkernel of a failing/erroring PCI Device.

Resolution

Engage the hardware vendor with a screenshot of the PSOD screen and the device details for further diagnostics and troubleshooting. 

Additional Information

How to identify the failing device:

PCI device address of the failing device will be part of the ALERT seen on the PSOD screen or the crash logs. From the example stated in the introduction 
ALERT: ApeiHEST: 327: Fatal error from 0000:XX:00.0(PCI Express Endpoint), VID:####, DID:####, DevSts: 0xd, AERUeSts: 0x2000.

Use this device address (ID) to locate the device in the vSphere UI following steps listed below.

  1. Connect to vSphere UI .
  2. Browse to the host in the vSphere Client navigator.
  3. On the Configure tab, expand Hardware and click PCI Devices.
  4. Click on the "All Devices" button to list all devices
  5. Locate the device with the ID from the ALERT message from the PSOD Screen.

Alternatively, you can view the device details using the lspci command in ESXi Shell to list the device.

Example:
#lspci |grep 0000:XX:00.0

Output:
0000:XX:00.0 <Device Class>: <Device Vendor> <Device Model>