Post reboot we need to validate the servers and make sure all our services and applications are running as expected.
Can you please help the steps or things we need to validate ( by way of automation/script) to check that all is well ?
Release : 20.3
Component : UIM - ROBOT
Here are some additional log messages and functionality which can be monitored which may be of interest:
In robot log you should always see the robot successfully establishing contact with hub:
Jul 30 14:50:21:912  0 Controller: Hub CoreA(10.173.36.161) contact established
When robot is fully started it must be in LISTENING state for TCP connections (can be checked e.g. via netstat, or use telnet or other utility) on ports:
When hub finishes starting up the following will always be logged:
Jul 30 14:52:21:072  0 hub: hubi main thread started
When hub is fully started it must be in LISTENING state for TCP connections on ports:
When data_engine fully starts up you would see the following:
Jul 30 14:42:33:229  0 de: data_engine starting main processing loop. vbRun=1 vbShutdown=0
wasp probe always logs the following when it is fully started up:
Jul 30 14:44:08:005 INFO [main, com.nimsoft.nimbus.NimProbe] ****************[ Starting ]****************
Additionally you can check wasp.cfg and look for http_port and/or https_port setting (usually 80 or 8080 or 443 or 8443). These port(s) should be listening for TCP connections and similar to hub/robot ports you can check them for availability.
when cabi probe is operational it will always log the following:
Jul 30 14:53:01:595 [UserSynchronizationThread, cabi] Finished synchronizing users between UIM and CABI
For any other probe which might be of concern you can follow a process like:
- set loglevel on probe to "0"
- deactivate/activate probe
- check which messages are logged consistently on probe startup.
Any log message which appears on level 0 would also appear at any other loglevel.