You are running SSP 5.0 or later and have encountered an alarm with the description:
"The CPU usage of Security Services Platform cluster {{ .ResourceID }} is currently {{ .Value }}%, which exceeds the threshold value."
This indicates that one or more nodes in the SSP cluster are experiencing high or very high CPU usage, potentially impacting cluster performance and workload stability.
vDefend SSP Version: 5.0 and later
High Load Applications:
Analyze Cluster Load
Cluster level CPU usage is aggregated value of all node CPU usage
k get nodes
k describe node <worker node name>
-> will show all the pods in this nodek top pods -n nsxi-platform --sort-by=cpu
On the Security Services Platform UI:
Navigate to System > Platform & Features > Core Services
If the CPU intensive applications are any of the following, scale out corresponding category. (From 'k top pods -n nsxi-platform --sort-by=cpu
' we can know if the CPU intensive applications are part of the list)
rawflowcorrelator, overflowcorrelator / druid-middle-manager, druid-broker/ latestflow
- Analyticskafka-controller, kafka-broker
- Messagingminio
- Data Storage metrics-manager, metrics-app-server,metrics-query-server
- Metrics (Refer to KB: 384109 for metrics specific CPU spike issues)