Node 1 is stopping every other day.
1 node in vertica cluster is crashing frequently.
The logs don’t seem to expose any errors.
Restart the node with adminTools fixes the problem for a short time.
Release : 22.2
Soft lockup messages are a kernel issue.
Please contact the system admin for guidance.
We found soft lockup errors in
/var/log/dmesg
bash-4.1$ grep "soft lockup" dmesg
And the problem went away on its own.
This has come up several times across many customers, and it always points back to a problem with the system host or the esxi host.