After executing a backup and restore script on the Data Aggregator, the DA displays a down status in the Portal system status view ․
SYMPTOMS:
IMPACT: Data Aggregator is unavailable and cannot process metrics ․
DX NetOps Data Aggregator :: All Supported Versions
CONTEXT: Occurs after restoring the entire /DASharedRepo/ directory from a backup while the Data Aggregator and ActiveMQ services are actively running ․
STEPS:
1․ STOP SERVICES
Stop the Data Aggregator and ActiveMQ services on both nodes ․
EXPECTED: File locks on the shared repository are released․
2․ RENAME CORRUPT KAHADB
Path: /DASharedRepo/broker/kahadb
Rename the kahadb folder to kahadb․old ․
EXPECTED: The corrupted database directory is moved aside to allow recreation․
3․ START DATA AGGREGATOR and ACTIVEMQ SERVICE
COMMAND: ./dadaemin activate
VERIFY SUCCESS:
ROOT CAUSE: The activemq database is corrupted when restoring the /DASharedRepo/ directory from a backup while the Data Aggregator services are still running ․ On Fault Tolerant Data Aggregators, the kahadb is specifically stored in the shared /DASharedRepo/ path ․
PREVENTION: Do not backup and restore the entire /DASharedRepo/ directory ․ Scripts should only target specific configuration directories such as /DASharedRepo/certifications/custom and /DASharedRepo/certifications/extensions ․