This article assists users to create alerting in vCenter based on SMART codes for their NVMe devices and take proactive steps where possible to address drive health concerns before potential drive failure.
VMware vSAN introduced a number of new features, including the ability to alarm on SMART data provided by hard drive vendors in relation to the overall 'health' of a disk.
The following five VOB alerts have been created to report statistics from SMART data for NVMe devices:
| VOB Alert | Description |
| vob.vsan.lsom.temperaturenvmediskhealthcriticalwarning | Reports an NVMe disk's available spare capacity is low when below critical threshold. |
| vob.vsan.lsom.temperaturenvmediskhealthcriticalwarning | Reports when an NVMe disk temperature is beyond threshold. |
| vob.vsan.lsom.reliabilitynvmediskhealthcriticalwarning | Reports when an NVMe disk has become unreliable. |
| vob.vsan.lsom.readonlynvmediskhealthcriticalwarning | Reports when an NVMe disk has become read-only. |
| vob.vsan.lsom.backupfailednvmediskhealthcriticalwarning | Reports when an NVMe disk's volatile memory backup device has failed (if present). |
VMware by Broadcom highly recommends configuring these alerts in vCenter to be notified of these NVMe SMART codes due to potential hardware failure. As of this writing, vSAN does not take any proactive measures when these errors occur as SMART data does not adhere to any industry standard, and may vary between hardware vendors.
VMware vCenter Server 8.0U3 and higher
VMware vSAN (OSA & ESA) 8.0U3 and higher
NVMe devices