There are Administrative Events in Performance Management that state:
"Batch process job DimltemsETLjob failed"I
In a report for the last 30 days for Events, we see many instances of the following failure.
Device: DataAggregator:<IP-or-Name>
IM Data Aggregator Administration Event
Reporting ETL Service
Batch process job DimItemsETLJob failed.
Seen with release r3.7.3; can be seen in others newer or older.
Matching the Event to Data Aggregator logging, at the same time in the (default path) /opt/IMDataAggregator/apache-karaf-<version>/data/log/Exception.log we see the following error.
2019-11-26 15:15:19,6272019-11-26 15:15:19,627 | ERROR | ExceptionLog | An existing application exception RECURRED (Key=f53e96e098b360b7e3ea614aa28cedd577f99565), Recurrence count=23 : Exception encountered while performing dimension items batch job. : StatementCallback; SQL <Long vsql query>
[Vertica][VJDBC](4840) ERROR: Subquery used as an expression returned more than one row; nested exception is java.sql.SQLIntegrityConstraintViolationException:
[Vertica][VJDBC](4840) ERROR: Subquery used as an expression returned more than one row | (ExceptionLogger.java:104)
We can see the same exception error appear every hour in line with the appearance of the failed job Events being observed.
Release : 3.7
Component : IM Reporting / Admin / Configuration