BOSH deploy job fails due to dynatrace oneagent process injection

search cancel

BOSH deploy job fails due to dynatrace oneagent process injection

book

Article ID: 387118

calendar_today

Updated On:

Products

VMware Tanzu Application Service

Issue/Introduction

Dynatrace oneagent injects itself into processes on each BOSH deployed VM. It was observed in this case that Grafana job in healthwatch deployment would failed to start due to dynatrace oneagent.

BOSH deploy failed with error:

[2025-01-27T20:36:09.391707 #2753507] [canary_update(grafana/<redacted>(0))] ERROR -- DirectorJobRunner: Error updating canary instance: #<Bosh::Director::AgentJobNotRunning: 'grafana/<redacted> (0)' is not running after update. Review logs for failed jobs: grafana>

No errors were observed in grafana logs, however the grafana bpm.log shows a very long delay in starting the process. In example below, we see greater than 30 second delay in process start.

{"timestamp":"2025-01-28T20:34:33.147164391Z","level":"info","source":"bpm","message":"bpm.start.start-process.starting","data":{"job":"grafana","process":"grafana","session":"1.2"}}

...

{"timestamp":"2025-01-28T20:35:06.113521278Z","level":"info","source":"bpm","message":"bpm.start.releasing-lifecycle-lock.complete","data":{"job":"grafana","process":"grafana","session":"1.3"}}

The Dynatrace oneagent appears to have introduce signficant latency in starting of Grafana.

Resolution

The Dynatrace Oneagent needs to be excluded from deployment that it is causing to fail.

Perform the steps -

Dump the current runtime config:
```
bosh runtime-config > runtime.cfg
```

Edit the runtime config to exclude deployment from Dynatrace add-on

$ vim runtime.cfg

exclude:
  deployments:
     <failed deployment>

Update the runtime-config

bosh update-runtime-config runtime.cfg

Run a BOSH deploy or execute an apply changes in OpsManager.

Feedback

thumb_up Yes

thumb_down No