All certificates are not replaced by CARR in a Federation environment on first run
search cancel

All certificates are not replaced by CARR in a Federation environment on first run

book

Article ID: 416582

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • CARR script has been run in a Federation environment to replace expired and expiring certificates
  • Expired/Expiring certificates are still present, and have a "Used By" value of 1 which is mainly the CLIENT_AUTH certificate entry.

Environment

VMware NSX 4.x

Cause

The need for multiple CARR runs stems from the cross-site trust model used in NSX Federation. 

Federation environments rely on Principal Identities (PIs) to trust and communicate securely between sites (GM <-> LM, GM <-> GM). When a site's Local or Global Manager Platform Certificate is changed, the corresponding Principal Identity on the remote site must be updated with the new certificate. 

When CARR runs on a site, it first remediates all self-signed certificates local to that site. After the local certificate is replaced, it is automatically pushed via site sync to the other LM or GM sites to update the remote Principal Identity.

The CARR remediation process is not fully aware of the stale, expired certificates that were replaced on remote sites during previous operations. The synchronization of the newly pushed certificate does not automatically clean up the previously replaced (now stale) certificate entries on the remote node's trust store. This results in stale certificate entries that remain on the remote appliance with a "Used By" value of CLIENT_AUTH. These stale certificates are not local to the original site, and therefore, CARR cannot identify and delete them in the first local run.

Resolution

In Federation environments CARR script must be run at least twice on all sites to complete self signed certificate rotation.
Stale certificates will be removed automatically on additional runs.

Note: CARR script should be run on all sites completely first before running again for a second time.

Additional Information

Using Certificate Analyzer, Results and Recovery (CARR) Script to fix certificate related issues in NSX