Dollar Universe - Jobs abort with "Error trying to kill processes with same group id" message
Article ID: 132779
CA Automic Dollar Universe
The error "Error trying to kill processes with same group id as process with PID [-27621]. Return code [-1]" is caused by the Time Control Management performing a "Force Complete", while at the same time, the submission of the job enters a "Running" status.
Dollar Universe 6 with Time Control Management
The issue is due to having "Time control management" enabled on a "high load" node which has a high number of parallel executions.
In order to avoid this type of issue, we recommend the following: a) Reduce the load of simultaneous submissions at the same time. Either by setting a DQM job limit, or splitting the load between different nodes. b) Increase the time control management period from 30 seconds to 120.