Problem:
Collect Task is failing with the message: "Common Server Collect Failed". Recycling services temporarily fixes the problem but after a few days it returns.
Cause:
The receiving thread handling incoming messages on CAITRM_CSERVERM fails to communicate with CAM. We then retry to reconnect. In the meantime, the receiving thread handling incoming messages on CA_ITRM_CSERVER also fails to communicate with CAM. Meanwhile, the connection to CAITRM_CSERVERM iss recovered. After recovering the connection for the CAITRM_CSERVERM receiving thread, it affects the CA_ITRM_CSERVER receiving thread and that thread thinks that it did not lose it's connection and hence does not attempt to reconnect. Once this condition occurs, the collect task begins to fail.
Solution:
There is a testfix available for this problem. Please open a support case and reference problem# DTSVMG-3471. Include the exact version of Client Automation you are running (output of DSMVER) and the correct patch will be provided.
Testfix 12.9 # T5IZ303.caz
Testfix 14.0 # T5IZ347.caz