In versions of vRealize Automation 7.5 and above the Lucene subsystem has been moved to Postgres which is more resilient to data corruption.
For 7.4 and earlier, the workaround to this issue is to clean up the Health Service sandbox and restart the service.
Note: Cleaning Health Service sandbox will delete all pre-configured tests.
Workaround:
The below mentioned steps need to be completed on each node before moving to the next step:
- Stop the Health Service monitor by commenting out the cron job in /etc/cron.d/monitor-vrhb-cron
- Kill any instances of the monitor that might be running:
ps -A | grep monitor-vrhb.sh | awk '{print $1}' | xargs --no-run-if-empty kill -9 $1
- Stop the Health Service
service vrhb-service stop
- Verify the service is stopped, if a process is found kill it manually.
ps aux | grep Quorum
- Cleanup up the Health Service datastores (aka Sandboxes)
rm -r /var/lib/vrhb/service-host/sandbox
rm -r /var/lib/vrhb/vra-tests-host/sandbox
- Restart the Health Service
service vrhb-service start
- Re-enable the Health Service Monitor by uncommenting the cron job in /etc/cron.d/monitor-vrhb-cron