vSAN Health Service - Data Protection - Local protection - protection group health
search cancel

vSAN Health Service - Data Protection - Local protection - protection group health

book

Article ID: 389834

calendar_today

Updated On:

Products

VMware vSAN

Issue/Introduction

This article explains the purpose and details of 'Local protection - protection group health' health check and provides details on why it might report the warning or error and how to fix the waring or error state.

Environment

VMware vSAN 9.0

Resolution

Q: What does the 'Local protection - protection group health' Health Check do?

This health check monitors the status of protection group snapshots in the protection group and provide detailed information such as failure reason and timestamp for the last failed snapshot to facilitate prompt identification and resolution of the issue.

Q: What does it mean when it is in an warning/error state?

When the health status is in a Warning or Error state, it indicates one or more protection group snapshot failures. If protection group snapshot experience failures, the state of the protection group in the health check table will be marked as RedIf one or more VM snapshots within the protection group fail, the group's state will be marked as Yellow.

Furthermore, if any protection group has a Red state, the overall health status will be set to Error. Otherwise, it will be set to Warning.

Q: How does one troubleshoot and fix the warning/error state?

Users can refer to "Failure reason" in both of "Unhealthy protection group" and "Failed snapshots" health check tables for the remediation guide. For each VM under protection groups, the health check will monitor the latest snapshot status and also the continuous failures. If the VM snapshots keeps failing, most likely it's not caused by the vSAN data protection service but other underlying infra issues and users need to take a further look on the specific VMs.

Screenshots

If protection group snapshot experience failures, the overall health status is Error:

The overview card as error state:

The table 'Unhealthy protection group' shows the details of the protection groups experienced failed PG snapshot:

The table 'Failed snapshots' shows the details of the failed snapshots for the specific VMs (No information here since this is the PG snapshot failure):

 

If one or more VM snapshots within the protection group fail,  the overall health status is Warning:

The overview card is shown as the warning state:

The table 'Unhealthy protection group' shows the details of the protection groups with failed VM snapshots:

The table 'Failed snapshots' shows the details of the failed snapshots for the specific VMs: