We are looking to create an observability grid and need help determining which log files we should look at to provide greater visibility of the systems. Which logs should we be scanning to see if there are potential issues with the service or provide insight on outages for the platforms?
All supported releases
Portal:
/opt/CA/PerformanceCenter/DM/logs/DMService.log
Sync related issues
/opt/CA/PerformanceCenter/EM/logs/EMService.log
Event and email related issues
/opt/CA/PerformanceCenter/PC/logs/PCService.log
UI and connection related issues
/opt/CA/PerformanceCenter/sso/logs/SSOService.log
Authentication related issues
Data Aggregator:
/opt/IMDataAggregator/apache-karaf-*/data/log/karaf.log
The general Data Aggregator log, most everything of concern would be here
/opt/IMDataAggregator/apache-karaf-*/shutdown.log
Any connectivity issues with the Data Repository would be shown here as well
/opt/IMDataAggregator/broker/apache-activemq-*/data/activemq.log
ActiveMQ connectivity issues
Data Collector:
/opt/IMDataCollector/apache-karaf-*/data/log/karaf.log
The general Data Aggregator log, most everything of concern would be here
/opt/IMDataCollector/apache-karaf-*/shutdown.log
Any connectivity issues with the Data Repository would be shown here as well
/opt/IMDataCollector/broker/apache-activemq-*/data/activemq.log
ActiveMQ connectivity issues
Data Repository:
$CatalogDir/drdata/v_drdata_node0001_catalog/vertica.log
The main Vertica log. Most major issue messages will have the work "Panic" in them