Device naa.xxxxx123 performance has deteriorated. I/O latency increased from average value of 1832 microseconds to 19403 microseconds
VMware vSphere ESXi 6.x
VMware vSphere ESXi 7.x
VMware vSphere ESXi 8.x
To get the lun level device performance statistics data use the esxtop utility. Refer to this article for more information: Using esxtop to identify storage performance issues (1008205)
High device latency
It is useful to consider a 4-dimensional framework to characterize the latency.
1) Magnitude: How high are the spikes in DAVG?
2) Duration: How long does each spike last?
3) Frequency: What sort of pattern is exhibited by the date / time stamps?
4) Spread: How widespread are the events?
Magnitudes of limited amount (say, for example, 20-30ms) for durations of only a few seconds, on an occasional frequency, on a small subset of datastores, is a vastly different situation than, say, magnitudes of multiple seconds, for durations of multiple minutes. The latter situation could be perceived by most VMs as a storage outage, and Linux machines, for example, can turn their disks read only as a protective measure.
A USEFUL STRATEGY TO SCOPE THE EXTENT:
1) Extract all of the events from vmkernel.log (and its .gz rotations) that contain the string "performance has deteriorated", then export those events and import them into a spreadsheet application such as Excel, or Open Office.
2) With the raw data in the sheet, parse the data into Columns.
3) Example Column headings would be:
4) With this data, then sort the data by Hostname, Date and Time in UTC.
5) Finally, use the Data --> Filter feature of Excel to analyze the data by Magnitude, Duration, Frequency and Spread.
6) Finally, please remember that ESXi does not cause the latency spikes -- it merely reports them. The cause is not possible to determine from an ESXi point of view, but data as outlined above can help inform the investigation outside of the ESXi host(s).