During a SQL Server backup restore operation for virtual machine TEST, the vSAN cluster experiences degraded performance, characterized by:
Elevated vSAN cluster-wide read/write latency.
Increased vSAN backend latency observed during restore activity.
SSD congestion values exceeding threshold on host ESXi1.
UUID: 5227bfcc-####-####-####-############ssdCongestion: 102ssdCongestionLocalMax: 102125esxcli vsan debug disk list
UUID: 5227bfcc-####-####-####-############ Name: naa.500############ Owner: ESXi1 Version: 15 Disk Group: 5227bfcc-####-####-####-############ Disk Tier: Cache SSD: true In Cmmds: true In Vsi: true Fault Domain: N/A Model: MZILG800HCHQAD3 Encryption: false Compression: true Deduplication: true Dedup Ratio: N/A Overall Health: green Metadata Health: green Operational Health: green Congestion Health: State: green Congestion Value: 112 Congestion Area: ssd All Congestion Fields: SSD: 112 Log: 0 IOPS: 0 Slab: 0 Memory: 0 Space Health:
VMware vSAN 7.x
The performance issue was caused by excessive I/O load during the backup restoration operation on VM TEST.
Specifically:
ESXi1 became congested, impacting the ability to service new I/O.Result:
vSAN memory or SSD congestion reached threshold limit
vSAN performance diagnostics reports: "vSAN is experiencing congestion in one or more disk group(s)"