Unable to register the Gateways/PDPs:
Release: 1.14.50
Debugging the reported issue, following the issue description, the apparent cause of the "Unregistered" state of all the gateways, including the PDP and the management, was the expired certificate. For the technical tasks to validate this, please refer to the below.
for GW / management:
Connect to all the problematic GW,
Search for the “async_services_storage”
folder.
Execute openssl x509 -enddate -noout -in
<path_to_async_services_storage>/ca_certificate.pem
Execute openssl x509 -enddate -noout -in
<path_to_async_services_storage>/async_services_cert.pem
Check the validity of the output. On all the gateways, it was confirmed that the certificate had expired on February 08, 2022.
for PDP
Connect to the problematic PDP gateways,
Search for the “pdp_storage”
folder.
Execute openssl x509 -enddate -noout -in
<path_to_pdp__storage>/ca_certificate.pem
Execute openssl x509 -enddate -noout -in
<path_to_pdp__storage>/pdp_cert.pem
Check the validity of the output. On all the PDP, it was confirmed that the certificate had expired on February 08, 2022.
Note that the patch is temporary and does not survive upgrade! It is preferable to perform the manual certificate renewal process (Manual - certificate renewal ), but if the environment is already down, or the customer has a 1.13 version, proceed with the TEMPORARY patch, which has to be provided by R&D. This implementation will require about 15 minutes, per gateway, and with one-minute downtime, per gateway, for the restart of the containers.
Also note that certificate expiration in itself wouldn't change the state of all the GWs to "Unregistered", and to help investigate and work to identify all the antecedents that followed the "Unregistered" state of the GWs, to help work to prevent future occurrence, requisite log files within the /var/log/fireglass.log directory, from all the affected GWs/PDPs would be required.