Users experience intermittent failures when attempting to log in using Active Directory (AD) credentials. Local admin accounts function correctly.
The following symptoms are observed:
One node in the cluster has a significantly different uptime compared to other nodes.
One node may appear to be rebooting consistently or recently recovered from a crash.
Errors such as unknown_ca or authentication timeouts appear in the runtime.log on specific nodes:
[2025-06-12 09:15:57.626+0000] ["netty-event-loop-67"/##.###.###.# ERROR] [play.core.server.netty. PlayRequestHandler] [Exception caught in Netty] io.netty.handler.codec. DecoderException: javax.net.ssl. SSLHandshakeException: Received fatal alert: unknown_ca
The cassandra.log may show:
2025-06-12T09:28:40,252 AbstractChannelHandlerContext.java:311 - An exception 'java.lang. NullPointerException' [enable DEBUG level for full stacktrace] was thrown by a user handler's exceptionCaught() method while handling the following io.netty.handler.codec. DecoderException: javax.net.ssl. SSLHandshakeException: java.lang.Unknown_ca
Aria Operations for Logs 8.x
The cluster is in an inconsistent state. This occurs when one node reboots or falls out of sync while other nodes remain active for an extended period. This uptime mismatch can cause authentication requests to fail intermittently depending on which node processes the request.
Perform a controlled rolling reboot of the cluster nodes to restore consistency.
Verify Cluster Health
Log in to the Primary node UI as a local admin user.
Navigate to Administration > Cluster.
Note the IP addresses/FQDNs of all nodes (Primary and Workers).
Power off Worker Nodes
From vSphere, go to the Virtual Machine for each Worker node and click Shut Down Guest OS until all Workers are off.
Power off Primary Node
From vSphere, go to the Primary Virtual Machine and click Shut Down Guest OS.
Power on Primary Node
From vSphere, go to the Primary Virtual Machine and click Power On. Wait for it to initialize.
Power on Worker Nodes
From vSphere, power on the Worker nodes one by one.
Verify Resolution
Once the cluster is fully online, attempt an Active Directory login.
Confirm that runtime.log no longer shows authentication errors.
For more information on cluster management, see the Aria Operations for Logs Administration Guide.
If the steps above do not resolve your issue, try this KB: Aria Operations for Logs Active Directory Integration Fails on Port 636 Due to SSL Certificate Issue