A Dashboard scheduled to send via email arrives with query failures for the Views.
The same Dashboard runs fine in the Portal.
The same Dashboard returns similar query ID failures when the Run Now option.
The same happens for PDF or CSV exports.
The Dashboard has just two Views on it, and the failure is random which one is impacted.
Views on the Dashboard in the PDF show a message like this. The Query ID will be unique for each instance of the error.
Looking in the (default path) /opt/CA/PerformanceCenter/PC/logs/PCService.log file for the Query ID value we see an error for the report. The error states:
Reason: {
Error occurred while running a RIB query on Data Aggregator RIB Source. Query ID: RIBQuery_469cfb1d_91ae_49bc_8713_c3f7578b3cc2
Possible reason: Could not get JDBC Connection; nested exception is java.sql.SQLNonTransientConnectionException: [Vertica][VJDBC](100176) Failed to connect to host <DR_Node_Name-Or-IP> on port 5433. Reason: Failed to establish a connection to the primary server or any backup address.
[Vertica][VJDBC](100176) Failed to connect to host <DR_Node_Name-Or-IP> on port 5433. Reason: Failed to establish a connection to the primary server or any backup address.
[Vertica][VJDBC](100176) Failed to connect to host <DR_Node_Name-Or-IP> on port 5433. Reason: Failed to establish a connection to the primary server or any backup address.
Failed to establish a connection to the primary server or any backup address.
Connection refused (Connection refused)
All supported DX NetOps Performance Management releases.
The Data Repository Vertica database had one of it's nodes in a multi-node cluster down. As a result the query would fail when it needed to use that node to obtain data.
Restart the database node that is down and the errors will be resolved.
To restart a single node that has been done for more than a few days in a multi-node cluster take the following steps.