The Integrated Load Balancer (VIP) for an VCF/Aria Operations for Logs multi node cluster is stuck in an "Unavailable" state.
The /storage/core/loginsight/var/runtime.log will show the nodes rapidly flapping and fighting for leadership multiple times per second:
["LeaderElectionStateUpdaterScheduler-thread-2"/<IP_ADDRESS> INFO] [com.vmware.loginsight.election.cassandra.PullBasedLeaderElection] [Discovered that leadership was lost, attempt to become leader, Group: defaultLeadersGroup, ParticipantId: ########-####-####-####-############]["LeaderElectionStateUpdaterScheduler-thread-2"/<IP_ADDRESS> INFO] [com.vmware.loginsight.election.cassandra.PullBasedLeaderElection] [Became leader again, Group: defaultLeadersGroup, ParticipantId: ########-####-####-####-############]
Additionally, the /storage/core/loginsight/var/cassandra.log will show write failures and massive numbers of tombstone cells being read:
WARN [ReadStage-1] <DATE_TIME> ReadCommand.java:605 - Read 0 live rows and 6182 tombstone cells for query SELECT * FROM machine_learning.spock_global_queries_v2 WHERE bucket = 0 LIMIT 5000 ALLOW FILTERING; token -#################### (see tombstone_warn_threshold)
...
java.util.concurrent.ExecutionException: com.datastax.oss.driver.api.core.servererrors.WriteFailureException: Cassandra failure during write query at consistency TWO (2 responses were required but only 0 replica responded, 1 failed)