COMPLETED_WITH_FAILURE0 update status and become inaccessible./var/log/vmware/vcf/sddc-manager/sddcManagerServer.log the following error is observed: ERROR: axios.error.response.data {"errorCode":"IDENTITY_SAML_TOKEN_EXPIRED","message":"The SAML token has expired on YYYY-MM-DD"}/var/log/vmware/vcf/lcm/thirdparty/upgrades/########-####-####-####-########/vcf-platform/upgrade/vcf_platform_upgrade.logINFO: Updated /var/log/vmware/vcf/lcm/thirdparty/upgrades/########-####-####-####-########/vcf-platform/upgrade/vcf_platform_upgrade.status status file with data OrderedDict([('upgradeId', '########-####-####-####-########'), ('resourceId', '########-####-####-####-########'), ('upgradeStatusCode', 'INPROGRESS'), ('progress', 0), ('error', {'errorCode': None, 'errorDescription': None}), ('startTime', 1773073900)])INFO: Updated /var/log/vmware/vcf/lcm/thirdparty/upgrades/########-####-####-####-########/vcf-platform/upgrade/vcf_platform_upgrade.status status file with data OrderedDict([('upgradeId', '########-####-####-####-########'), ('resourceId', '########-####-####-####-########'), ('upgradeStatusCode', 'INPROGRESS'), ('progress', 0), ('error', {'errorCode': None, 'errorDescription': None}), ('startTime', 1773073900)])INFO: Execute cmd: tdnf --disablerepo=* list installed > /var/log/vmware/vcf/lcm/thirdparty/upgrades/########-####-####-####-########/vcf-platform/upgrade/tdnf_list_before_upgrade.txt/var/log/vmware/capengine/cap-update/workflow.log or /var/log/vmware/vcf/lcm/lcm-debug.log shows the process stalled at the following command: INFO: Execute cmd: tdnf --disablerepo=* list installedVCF 9.x
This issue is primarily caused by a transient hang or severe I/O delay in the Photon OS package manager (`tdnf`) while querying the local RPM database. This triggers a hardcoded 1-hour timeout gate within the SDDC Manager Lifecycle Management (LCM) orchestrator.
Why the SAML error occurs: The IDENTITY_SAML_TOKEN_EXPIRED error is a secondary symptom. Because the upgrade task hangs for over 60 minutes, the internal services may become unresponsive or lose connectivity to vCenter. When the system finally attempts to resume or report status after the timeout, it finds the original SAML authentication token has expired and cannot be refreshed while the services are in this stalled state, SDDC Manager Down.
Since the underlying RPM lock or I/O bottleneck is transient, no manual intervention (such as clearing RPM database locks or manually removing packages) is required.