Action taken to avoid these in the future:
1. Enhance internal monitoring to identify unhealthy nodes and move pods to healthier nodes.
Due Date: Sep 28, 2021
2. Code improvement to use asynchronous refresh of configuration that is independent of metric ingestion
Due date; Nov 15, 2021
Status: In Progress