Node pool upgrade at scale with maxSurge in Telco Cloud Automation (TCA) 2.3
search cancel

Node pool upgrade at scale with maxSurge in Telco Cloud Automation (TCA) 2.3

book

Article ID: 325375

calendar_today

Updated On:

Products

VMware VMware Telco Cloud Automation

Issue/Introduction

The purpose of this document is to provide a detailed description of the procedure for performing a rolling upgrade of a node pool.

Note: Running maxSurge will require extra resources within the infrastructure.


Environment

VMware Telco Cloud Automation 2.3

Resolution

Pre-requisites

  1. Ensure the workload cluster is in a healthy state by running cluster diagnostics to confirm its overall health.
  2. Verify that the network function is up and running by conducting pre-checks to ensure the application is healthy.
  3. Before beginning the node pool upgrade, schedule a maintenance window or implement traffic isolation.

Procedure
  1. Upgrade the control plane for the workload cluster to the tbr-bom-2.3.0-v1.24.10 BOM version.          
  2. Upgrade the node pool to the tbr-bom-2.3.0-v1.24.10 BOM version.
    1. Under the Edit node pool panel, select the tbr-bom-2.3.0-v1.24.10 BOM version and update the VM template.
    2. Manually modify the TCAnodepool CR to update the TKG upgrade strategy as illustrated below:
      image.png
      In this example the workload cluster is wc04 and the node pool is np1:
strategy:
    rollingUpdate:
      maxSurge: 4
      maxUnavailable: 0
    type: RollingUpdate
  1. After completing the node pool upgrade, ensure that the testNF pods have returned to the running state.
  2. Verify if the testNF application has been restored to its previous state after the upgrade.
  3. Check if the application is functioning as expected and has returned to its original state.