Supervisor Control Plane VM in Not Ready State

search cancel

Supervisor Control Plane VM in Not Ready State

book

Article ID: 394951

calendar_today

Updated On:

Products

vSphere with Tanzu

Issue/Introduction

The affected Supervisor Control Plane VM throws the following error in the vSphere UI:

"Node is not healthy and is not accepting pods. Details: Kubelet stopped posting node status."

Additionally, when running the command kubectl get nodes, the affected node shows a Not Ready status. Due to this, pods cannot run on the affected node.

Environment

vSphere with Tanzu 7.x

vSphere with Tanzu 8.x

Cause

The /root partition of the Control Plane VM is full, preventing the proper operation of the Kubelet and affecting the VM's ability to accept pods.

Resolution

1. SSH into the Affected Control Plane VM:

Follow the instructions in the "How to SSH into Supervisor Control Plane VMs" section of the Troubleshooting vSphere Supervisor Control Plane VMs

Use the root username and the password mentioned in the KB to access the affected VM.

2. Check Disk Space Usage:

SSH into the affected VM and run the following command to check disk usage:

df -h

This will show the disk usage of all partitions, and you will likely find that the /root partition is 100% full.

3. Clean Up Space on the Root Partition:

Follow the steps outlined in the KB article: vSphere Supervisor Disk Space Clean Up Scripts to clean up disk space.

Additionally, download and run the following Python scripts to clean up stale resources:

- - py (for cleaning up stale ReplicaSets)
  - py (for cleaning up stale images)

4. Check Other Control Plane Nodes:

Repeat the cleanup process on other Control Plane nodes as well to ensure they are not facing the same issue.

Additional Information

vSphere Kubernetes Supervisor Root Disk Space Full at 100%: vSphere Supervisor Root Disk Space Full at 100%

Feedback

thumb_up Yes

thumb_down No