After upgrading TKR to 1.29.15 the antrea package is in a Reconcile failed state
search cancel

After upgrading TKR to 1.29.15 the antrea package is in a Reconcile failed state

book

Article ID: 403956

calendar_today

Updated On:

Products

VMware vSphere Kubernetes Service

Issue/Introduction

After upgrading the guest cluster to 1.29.15 (photon). All nodes are upgraded successfully. After the upgrade, the antrea package is stuck in Reconcile failed state. Checking the antrea pkgi will see the following error message: 

kapp: Error: waiting on reconcile job/register-placeholder (batch/v1) namespace: vmware-system-antrea:

Finished unsuccessfully (Failed with reason BackoffLimitExceeded: Job has reached the specified backoff limit)

Environment

VKS 1.29.15

Resolution

 Manually delete the job:

kubectl delete job -n vmware-system-antrea  register-placeholder

Once the above command is run, wait for kapp to resync the antrea pkgi and for it to succeed. This could take up to 10 minutes. 

Additional Information

Affected Versions: TKR 1.26 to TKR 1.29
Fixed Versions:TKR 1.30 and above