During upgrade to 25.1 we enabled " Logs for Triage " as below:
The upgrade completed as expected but in "Cluster Manager" we see the reporting the error:
"Failed tenant update job (id" <your_tenant_name> . Reason: 500,1000,-: Update jobs has failed: Onboard dashboards (<your_tenant_name>), - (INTERNAL_ERROR)"
How to fix the ERROR?
The below exception is found in the apmservices-manager pod:
1) PATH the tenant configuration as below:
curl --insecure --location --request PATCH 'https://apmservices-gateway.<wildcardDNS>/tenants/tenant/<dsTenantId>' \
-H 'accept: application/json' \
-H 'content-type: application/json' \
-H 'Cache-Control: no-cache' \
--header 'Authorization: Bearer <token>' \
--data-raw '{"attributes":{"apm.onboarded.dashboards": true}}'
NOTE:
- apmservices-gateway.<wildcardDNS> can be obtain from connector parameters
- dsTenantId can be obtain from cluster manager by moving the mouse over the tenant name
- <token> can be obtain as below, for example in openshift:
oc rsh <apmservices-manager pod>
cat /apmservices.sec/bootstrap.properties
"apm.security.tenantServiceMasterToken" property value
Example:
2) From Cluster Manager, click configure the Tenant and click "Save"
Tenant update will complete as expected and no error will be reported
For assistance contact Broadcom Support