Error "Skip pod volume <VOLUME_NAME> error: daemonset pod not found in running state in node <NODE_NAME>" during workload cluster backup using Velero
book
Article ID: 434912
calendar_today
Updated On:
Products
VMware Tanzu Kubernetes Grid Management
Issue/Introduction
Velero backups complete with a PartiallyFailed phase.
The backup logs display the following error message for skipped volumes: Skip pod volume <VOLUME_NAME> error: daemonset pod not found in running state in node <NODE_NAME>
This failure occurs despite the node-agent pods being confirmed in a Running state on the impacted worker nodes.
This is a known issue with Velero v1.16.0, where the controller contains a hardcoded label selector requirement that queries for local node-agent pods using the role=node-agent label.
This label is missing from the default DaemonSet pod template specification, causing discovery to fail.
Resolution
Connect to the impacted Kubernetes cluster with administrative privileges.
Verify the node-agent pods are actively running on the worker nodes:
kubectl get pods -n velero -l name=node-agent -o wide
Patch the node-agent DaemonSet to inject the required role label into the pod template: