You are running vDefend SSP 5.0 or later and have encountered an alarm with the description:
"Platform Service {{ .ResourceID }} is degraded/down."
This indicates that one of the services in SSP or features is currently degraded or in an unhealthy state, impacting its functionality.
If the alarm stays open for more than 30 minutes or if its occurring multiple times, proceed to the Resolution section.
vDefend SSP Version: 5.0 and later
One or more pods of platform service {{ .ResourceID }} are not in a running state
Steps to resolve:
Try re-starting the deployment/statefulset. This should take care of any transient issues
Log into SSPI root shell.
Get the pod name for the {{ .ResourceID }} , using 'k -n nsxi-platform get pods | grep {{ .ResourceID }}'
k -n nsxi-platform get pod <pod-name> -o jsonpath='{.metadata.ownerReferences[0].kind}'
If the output is StatefulSet, follow the StatefulSet restart steps.
If the output is ReplicaSet, it belongs to a Deployment
If {{ .ResourceID }} is statefulset run:
k -n nsxi-platform rollout restart statefulset {{ .ResourceID }}
Otherwise, run:
k -n nsxi-platform rollout restart deployment {{ .ResourceID }}
Wait for ~20 minutes and check if the alarm is auto-resolved. (k -n nsxi-platform get pods to check restarted pod are up)
If the alarm persists, check for the following
Check for disk usage alarms : https://knowledge.broadcom.com/external/article?articleNumber=384119
Check for memory usage alarms: https://knowledge.broadcom.com/external/article?articleNumber=384120
Check for CPU usage alarms : https://knowledge.broadcom.com/external/article?articleNumber=384118
If none of the above is applicable, please open a ticket with Broadcom Support.