containerd process shows a significantly high Resident Set Size (RSS), sometimes reaching over 100 GB.3.2
The issue is caused by a known internal memory leak within the containerd daemon version 1.6.6. Specifically, the leak occurs within the Container Runtime Interface (CRI) and Task Service. Over long periods of uptime, the Go runtime heap grows uncontrollably as memory pages are trapped and not properly reclaimed by the daemon.
To immediately reclaim memory and restore node stability without a full upgrade, restart the containerd service on the affected node:
systemctl restart containerdNote: Make sure that the node is cordoned and drained before restarting the service.
The permanent resolution requires upgrading the TCA to 3.3.0.1 version with Kubernetes 1.30 .
Containerd is tied to the BYOI template and the TKG VM. New BYOI templates are introduced in TCA 3.3.0.1 version which will have the new containerd version. The containerd version within that is 1.7.29 which will resolve the issue
Supported doc containerd version v1.7.29