BOSH recreate or resurrection tasks kept queuing up. BOSH tries to recreate the missing VM and fails with Stemcell snapshot missing errors.
This error can be seen in the bosh task <#> --debug
output:
E, [2018-02-02 11:00:09 #10383] [task:32342] ERROR -- DirectorJobRunner: Error resolving problem '3876': Unknown CPI error 'Unknown' with message 'The object[s] '<[Vim.VirtualMachine] vm-7068>' should have the following properties: ["snapshot"] , but they were missing these: #<Set: {"snapshot"}> .' in 'create_vm' CPI method
BOSH deployments have multiple stemcell VMs. In a vSphere environment, BOSH will create a vSphere snapshot of the stemcell VM with the name "initial". Then BOSH will use this snapshot to clone new VMs for use in a deployment.
If the snapshot on the stemcell is deleted or missing, BOSH won't be able to create any new VMs using the stemcell. This can cause a backlog in the BOSH tasks if it is trying to resurrect a VM because the task will take few minutes to fail or timeout while other tasks are already queuing up.
bosh tasks --no-filter
bosh task <#> --debug
Workaround
In vSphere vCenter console, create the snapshot manually, naming the snapshot "initial".