Error authenticating user"FAILED: Unable to get user data. Possible Cassandra is down."li-reset-admin-passwd.sh script results in error output: "FAILED: Unable to get user data. Possible Cassandra is down."/var/log/loginsight/runtime.log
java.util.concurrent.ExecutionException: com.datastax.oss.driver.api.core.servererrors.ReadTimeoutException: Cassandra timeout during read query at consistency QUORUM (2 responses were required but only 1 replica responded).
/storage/core/loginsight/var/cassandra.log contains entries similar to the following:
INFO [main] NativeTransportService.java:73 - Netty using Java NIO event loop
WARN [main] NativeTransportService.java:166 - epoll not available
java.lang.UnsatisfiedLinkError: /tmp/libnetty_tranport_native_epoll_x86_##################.so: /tmp/libnetty_transport_native_epoll_x86_##################.so: failed to map segment from shared object
at java.lang.ClassLoader$NativeLibrary.load0(Native Method) ~[?:?]
at java.lang.ClassLoader$NativeLibrary.load(Unknown Source) ~[?:?]
at java.lang.ClassLoader$NativeLibrary.loadLibrary(Unknown Source) ~[?:?]
The Cassandra database is in an inconsistent state, resulting in data retrieval failures.
nodetool-no-pass status UN, a status of DN indicates that the Cassandra service is not running as expected on that node (UN = Up Node ; DN = Down Node)
systemctl stop loginsight
/usr/lib/loginsight/application/sbin/li-cassandra.sh --startnow --force UN status, stop/start cassandra on those nodes /usr/lib/loginsight/application/sbin/li-cassandra.sh --stopnow --force
/usr/lib/loginsight/application/sbin/li-cassandra.sh --startnow --forceUN status) run flush/repair on all nodes:nodetool-no-pass flush
nodetool-no-pass repair
/usr/lib/loginsight/application/sbin/li-cassandra.sh --stopnow --force
systemctl start loginsightnodetool-no-pass statusNote:
Remove all snapshots as soon as they are no longer needed.
Snapshots existing for longer than 72 hours will cause performance issues. Check Best practices for using VMware snapshots in the vSphere environment for more details.
If you still encounter the error do proceed with performing sequential reboot of all the nodes.