Symptoms:
- After successful vSAN Cluster shutdown and Restart the alert for Stats Primary Election is triggered.
- Under this health check, the Stats Primary is not visible, but the CMMDS primary is visible
- Stats object shows missing.
- All the objects are healthy and all the VMs can be Powered On.
Issue Validation:
Performance Stats object ".vsan.stats" exists but shows as missing even though all its components are active and the object is healthy:
Object UUID: xxxxxxxx-2ed5-xxxx-a4c8-xxxxxxafc016
Version: 20
Health: healthy
Owner: xxxxxxxxesx006.xxxx.xxxxxxx.com
Size: 512.00 GB
Used: 0.19 GB
Used 4K Blocks: 0.00 GB
Policy:
stripeWidth: 1
cacheReservation: 0
proportionalCapacity: [0, 100]
hostFailuresToTolerate: 2
forceProvisioning: 0
spbmProfileId: xxxxxxxx-73b1-xxxx-b072-xxxxxx272e5
spbmProfileGenerationNumber: 1
storageType: Allflash
replicaPreference: Capacity
iopsLimit: 0
checksumDisabled: 0
CSN: 35
SCSN: 38
spbmProfileName: Management - Optimal Datastore Default Policy - RAID6
Configuration:
Concatenation
RAID_1
Component: xxxxxxxx-540f-xxxx-9cc7-xxxxxxafc016
Component State: ACTIVE,Address Space(B): 273804165120 (255.00GB), Disk UUID: xxxxxxxx-1366-xxxx-6e13-xxxxxx8e4008, Disk Name: t10.NVMe_xxxxxxxxKYDMV_xxxxxxxxxxxxxxxx:2
Votes: 2,Capacity Used(B): 13877248 (0.01GB), Physical Capacity Used(B): 13877248 (0.01GB), Total 4K Blocks Used(B): 0 (0.00GB), Host Name: xxxxxxxxesx006.xxxx.xxxxxxx.com
Component: xxxxxxxx-540f-xxxx-9cc7-xxxxxxafc016
Component State: ACTIVE, Address Space(B): 205353123840 (191.25GB), Disk UUID: xxxxxxxx-6880-xxxx-0cb7-xxxxxx4dc3f4, Disk Name: t10.NVMe_xxxxxxxxKYDMV_xxxxxxxxxxxxxxxx:2
Votes: 1,Capacity Used(B): 1547904 (0.00GB), Physical Capacity Used(B): 8650752 (0.01GB), Total 4K Blocks Used(B): 0 (0.00GB), Host Name: xxxxxxxxesx006.xxxx.xxxxxxx.com
Type: vmnamespace
Path: /vmfs/volumes/vsan:xxxxxxxxxxxx78139-xxxxxxxx477ba664/.vsan.stats (Missing)
Group UUID: xxxxxxxx-2ed5-xxxx-a4c8-xxxxxxafc016
Directory Name: .vsan.stats
VMware vSAN 8.x
When object is created, as DOMPauseAllCCPs is set to 1 at that time, VsanObjCreateInt failed with vmfs not ready error, and create failure would trigger object rollback, but it failed because deleting the new object failed with I/O error, so it left the object in the environment.
Broadcom Engineering Team is aware of this issue and they are working to fix it in upcoming ESXi 9.0U1 version.
Current Workaround available is to Delete and Recreate the stats object via RVC: