TKG ready state shows as false every few minutes.
kubectl get tkc
NAME CONTROL PLANE WORKER TKR NAME AGE READY TKR COMPATIBLE UPDATES AVAILABLEtanzu-xxxxx 3 5 v1.26.12---vmware.2-fips.1-tkg.2 441d False True
tanzu package installed list -A
NAMESPACE NAME PACKAGE-NAME PACKAGE-VERSION STATUStanzu-package cert-manager cert-manager.tanzu.vmware.com 1.12.2+vmware.1-tkg.1 Reconcile succeededtanzu-package contour contour.tanzu.vmware.com 1.25.2+vmware.1-tkg.1 Reconcile succeededtanzu-package fluent-bit fluent-bit.tanzu.vmware.com 1.9.5+vmware.1-tkg.2 Reconcile failedtanzu-package grafana grafana.tanzu.vmware.com 10.0.1+vmware.1-tkg.1 Reconcile succeededtanzu-package prometheus prometheus.tanzu.vmware.com 2.37.0+vmware.3-tkg.1 Reconcile succeeded
Fluentd Log Snippet Indicating an Issue with Elasticsearch:
2025-02-21T08:26:24.019008509Z stdout F 2025-02-21 08:26:24 +0000 [warn]: #0 [in_tail_container_logs] Pattern not matched: "2025-02-21T08:11:10.006873071Z stderr F {\"log.level\":\"error\", \"message\":\"Failed to connect to backoff(elasticsearch(http://10.190.1.9:9200)): Connection marked as failed because the onConnect callback failed: 429 Too Many Requests\"}"
VMware vSphere with Tanzu
Issue is caused by Fluent Bit being unable to connect to Elasticsearch, resulting in a Reconcile failed status.
Fluent Bit needs to establish a connection with Elasticsearch server for log forwarding. If network is unavailable, Fluent Bit detects error and enters Reconciling state.
This issue is caused by a known Antrea bug.
According to release notes, TKG clusters using Antrea package v1.11.1 may randomly enter ClusterBootstrapReconciling state, causing networking issues. In this case, TKC 1.26.12 is using Antrea version 1.11.1.
vSphere Supervisor 7.0 Release Notes
Upgrade to TKR v1.26.13 with Antrea version v1.11.2 or later to resolve the issue.
If the customer does not use Fluent Bit for log forwarding, they can remove it to stabilize TKC cluster.
List the installed Fluent Bit package:
Delete the installed Fluent Bit package:
Verify Fluent Bit has been removed: