We have the UMA deployed in an EKS cluster and it is collecting metrics (into our tenant). Some of our teams have some questions regarding the metrics.
Need to confirm the normal value for metric:
<User-defined Cluster Name> | Cluster Deployment | Infrastructure Agent | Kubernetes | Nodes | <node name>:Ready
Q1: Would 1 be normal value (when node is in ready state) & 0 should set the alarm? The official documentation doesn’t have an actual number for this metric.
Q2. We’ve also noticed that one of the node has been decommissioned from the cluster but it still shows active in the UI.
We’re trying to understand how that would work? How long do we expect the node to still show active in the UI?
A1. Yes, 1 is when in a Ready state. Regarding the Ready state, it usually is provided by the kubelet when it can operate. In other words, when no condition prevents it to pursue normal operations.
A2. As long as there is an agent running on the node, connectivity will exist and the node will be shown.