distsrv: how to clear distribution jobs that are stuck
search cancel

distsrv: how to clear distribution jobs that are stuck

book

Article ID: 47874

calendar_today

Updated On:

Products

DX Unified Infrastructure Management (Nimsoft / UIM) CA Unified Infrastructure Management On-Premise (Nimsoft / UIM) CA Unified Infrastructure Management SaaS (Nimsoft / UIM)

Issue/Introduction

At times, there can be scenarios where the distribution server does not properly complete the distribution of a package. When this occurs, the package distribution information remains in the distsrv configuration and when the probe restarts, it continues to try to complete the distribution (looping) without success.

Eventually, this causes problems with the distsrv probe and a user could experience a number of issues that include but are not limited to...

  • Slow Performance
  • Intermittent Failures when deploying packages
  • Temporarily Out Of Resources error messages in the log
  • distsrv probe becomes red with Max restarts reached.. error in log. This occurs because the error situation(s) force the probe to restart itself in an attempt to recover. Since recovery never succeeds, the probe will go red due to Max Restart Reached.
  • queued distsrv jobs
  • 'Stuck' jobs that cant be removed/deleted

Environment

  • Release: UIM 20.3 or higher
  • Component: UIMDSV

Resolution

  1. Deactivate distsrv on local and remote hubs.

  2. Open Raw Configure and navigate to the Tasks section, and if there are any installs listed (they will be listed using the name provided to create them), try to rt-click remove/delete them. 

  3. Cold start distsrv on hub(s) via Deactivate->Activate

  4. If any leftover jobs persist in the "View Distribution Jobs" window within IM that you want to get rid of, then try the following:

a. Deactivate the distsrv probe

b. Navigate to the distsrv 'jobs' folder under \probes\service\distsrv\jobs and delete the jobs folder contents

If there are leftover jobs showing '0% progress' that cannot be removed, then

c. Open the distsrv.cfg and delete the install list, e.g., all entries from the distsrv section:

     <install_list>

        ...job1

        ...job2

     </install_list>


d. Restart the Primary hub robot.

That should clear out ALL of the old or stuck jobs in the View Distribution progress.

To confirm, open the Tools->Distribution window in IM and no leftover jobs should appear.

Additional Information

Note: Sometimes redeploying the distsrv helps lingering issues with Max restarts as well.
 
In older versions of distsrv, the jobs created via IM Distribution were numbered starting with 0. Any of these old remaining jobs should be removed.