It's important to monitor the vSAN cluster to ensure it doesn't become too full. There are Health Service alerts in place to help keep track of capacity utilization. Please see: vSAN Health Service - Capacity utilization - Disk space and vSAN Health Service - Physical Disk Health - Disk Capacity for more information.
If a vSAN datastore becomes too full, it can cause issues such as resyncs being stuck and certain management tasks to timeout and get stuck.
Other symptoms can include:
VMs not running despite having free space on vSAN Datastore.
vSAN Free Capacity recommendations: Understanding reserved capacity concepts in vSAN
VMware vSAN (All Versions)
vSAN Datastore becoming too full
The only recommended solutions are the following:
If this is not possible, there are some methods that can be done to clear space for management tasks such as adding more disks.
This can also be used for certain VMs that have fault tolerance built into their application by nature, such as having primary and secondary VMs where one takes over if the other fails.
Please see: How vSAN handles Policy Changes between RAID1 Mirroring and RAID 5/6 Storage Policies.Verify storage policy in use, in rare cases some objects previously migrated to vSAN datastore may have Storage Rule 'proportionalCapacity = 100' (thick) incorrectly assigned.
To identify such objects user should run the following commands:
cmmds-tool find -f python | grep 'proportionalCapacity\\\": 100' -B9 | grep uuid | cut -d "\"" -f4 >> /tmp/uuidlist.txt
(Note: this command creates a file in /tmp/uuidlist.txt with all the objects with 'proportionalCapacity = 100' rule.)
for i in $(cat /tmp/uuidlist.txt); do echo "*********************";echo; /usr/lib/vmware/osfs/bin/
(Note: this command outputs 'UUID <-> path' pairs based on previously created /tmp/uuidlist.txt file)
Based on the friendly names ('Object path') in the output user could determine list of good candidates (UUID) for conversion to thin, once completed (Re)apply Storage Policy in the UI by assigning Storage Policy with same characteristics object(s) already has (Failures to Tolerate, etc.) and with 'proportionalCapacity' rule set to '0'.