Randomly, a bunch of jobs go into status Aborted with Action "could not submit".
The universe.log of the node where the logical queue resides contains the following kind of errors
|ERROR|X|DQM|pid=p.t| u_dqm_sub_job | dqm_ecriture_article_job returns 6 [DQM_EMTRY QUEUE ... ]
|ERROR|X|DQM|pid=p.t| owls_dqm_job_submit | u_dqm_sub_job returns 6
|ERROR|X|DQM|pid=p.t| process_fnc_obj | process_create in error: 6
When looking into the file u_jobfile.dta we can find already old lines for that DQM_ENTRY from previous unsuccesfull job submissions.
Restarting those Aborted jobs works fine, jobs are submitted succesfully during the next attempt as their DQM_ENTRY is different.
Dollar Universe 6.x and 7.x
Defect
Launch uxresetque queue=QUEUE_NAME on the impacted queue while no jobs are running into that queue so that the old entries are removed from u_jobfile.dta and u_prmfile.dta
Update to a fix version listed below or a newer version if available.
Fix version(s):
Component: Dollar Universe Application Server (DUAS Node)
Dollar Universe 7.01.01 - Planned to be released end of September 2024
Defect ID: DU_AS-7622
Public Description: A problem has been fixed where job aborts with "u_dqm_sub_job returns 6" error message (duplicated key) due to the presence of deprecated records in DQM data files.