All jobs in Smarts NCM are stuck in a queued for execution state
NCM-10.1.x
Connectivity between the AS and DS hosts should always be tested prior to any attempt to resolve this issue via the steps listed below, since the below steps will have little or no effect if connectivity is at issue. One way to test connectivity is to use the openSSL instance installed by NCM on both the AS and DS hosts to directly test the connection from a command line session on each host. This can be accomplished running the following command in a command line session on the AS and DS hosts to establish a direct connect to each other outside of NCM:
openssl s_client -connect {target host ip}:443 -CApath {NCM home path}/conf/CA/
A successful connection will usually yield a fairly verbose result that contains information about the connection request and the certificate validation. Among the various lines returned, a successful connection will contain output similar to the following two lines:
... CONNECTED(00000003) ... SSL handshake has read 2726 bytes and written 383 bytes ...
If the connection fails, the last line of the output will usually return a failure code in the form of a number that is non-zero. If this occurs, connectivity troubleshooting should be pursued before proceeding with the below steps. The steps listed below may be unnecessary if lost connectivity can be restored. However, if the connection succeeds, or if the issue persists after connectivity is restored, it is likely that the command files NCM uses to communicate between the AS and the DS may be out legitimately out of sync. Out of sync command files must be cleared from the instance in order to allow an all new set of command files to be created by any new jobs that will be properly syncrhonized and able to flow normally between the AS and DS hosts.
To clear NCM command files from the instance, do as follows:
Instance Wide Preliminary Steps:
source /etc/voyence.conf
/etc/init.d/vcmaster stop
Application Server:
su - pgdba -c 'psql voyencedb voyence'
SELECT status, count(*) FROM cm_job WHERE status LIKE '%running' GROUP BY status;
UPDATE cm_job SET status = 'enum.taskStatus.canceled' WHERE status LIKE '%running';
\q
/etc/init.d/vcmaster stop
cd $VOYENCE_HOME/data/appserver/pops find . -name "acmd_*xml" -exec rm -f {} \; find . -name "cmd_*xml" -exec rm -f {} \; find . -name "status_*" -exec rm -f {} \;
Device Server:
cd $VOYENCE_HOME/data/devserver/syssync find . -name "acmd_*xml" -exec rm -f {} \; find . -name "cmd_*xml" -exec rm -f {} \; find . -name "status_*" -exec rm -f {} \;
/etc/init.d/vcmaster start