Symptoms:
You are unable to run a deployment. The deployment fails during the upload-releases phase.
This would seem to be a random failure, as it depends on the load on the Director database instance. This issue is observed when an external database instance is used, primarily AWS but this could affect GCP or Azure too, because the limited IOPS on the external instance can stall the upstream tasks from the Director VM.
The upload-releases task times out after 15min and retries for 5 times before bailing out of the deployment.
The error seen in the deployment is below:
===== 2018-08-30 16:17:50 UTC Running "/usr/local/bin/bosh --no-color --non-interactive --tty --environment=192.168.1.11 upload-release /var/tempest/releases/php-offline-buildpack-4.3.57-3586.27.0.tgz"
Using environment '192.168.1.11' as client 'ops_manager'
100.00% 0 B/s 15m0s
Uploading release file:
Director responded with non-successful status code '504' response '<html>
<head><title>504 Gateway Time-out</title></head>
<body bgcolor="white">
<center><h1>504 Gateway Time-out</h1></center>
<hr><center>nginx</center>
</body>
</html>