VMware vSAN is a software-defined storage solution that pools storage from multiple hosts to create a shared, high-performance storage system. While vSAN improves scalability and flexibility, sometimes performance issues such as slow application response times or high latency can occur.
This article provides step-by-step troubleshooting for common vSAN latency and performance issues, helping users quickly identify and fix problems within the environment.
This article is for vSAN administrators and users who want to:
VMware vSAN (All Versions)
Example Use Case: Long query execution times in a database server running on vSAN. Reports taking twice as long to generate.
Verify vSAN Latency in vCenter. To confirm the issue:
Example Use Case: In vCenter write latency is spiking to 50ms, which is much higher than expected.
1. Network Issues (One of the most common causes of vSAN slowness)
Problem: vSAN depends on fast network communication. If the network is slow or experiencing packet loss, vSAN performance will suffer.
How to Check:
Quick Fixes:
Example Use Case: One of the vSAN hosts is connected to a 1GbE network switch instead of 10GbE, which is slowing down the entire cluster. Upgrading the network connection immediately improves performance.
2. Storage Policies Causing High Workload
Problem: If VMs use RAID-5/6 policies, performance may degrade due to extra processing.
How to Check:
Quick Fixes:
Example Use Case: A critical financial application sees high disk latency. After changing the storage policy from RAID-5 to RAID-1, the application performance doubles.
3. Overloaded Hosts or Disks
Problem: If a vSAN host is running out of free space, performance will degrade.
How to Check:
Quick Fixes:
Example Use Case: A virtual desktop infrastructure (VDI) notices slow logins in the morning. Checking the vCenter Server, high disk utilization is seen. Adding additional storage fixes the issue.
4. Hardware and Firmware Issues
Problem: If storage controllers, SSDs, or NIC firmware are outdated, vSAN can slow down.
How to Check:
Quick Fixes:
Example Use Case: Random vSAN latency spikes. Storage controller firmware is updated, and the problem disappears.
If the issue persists after following these steps, collect the following information before creating a Broadcom case. For more information, see Creating and managing Broadcom support cases.