In a 3 or multi-node Vertica cluster, one of the nodes is unable to start, even though the others are up and running.
All supported versions of Dx NetOps Performance Management
In the Vertica log, located at:
<CATALOG-PATH>/drdata/v_drdata_node<NUMBER>_catalog/vertica.log,
The following is seen at the end of the log:
2023-04-18 07:35:44.160 Main:0x7f7a2245f600-fff0000000000cc1 <PANIC> @v_drdata_nodeXXX: VX001/2973: Data consistency problems found; startup aborted
HINT: Check that all file systems are properly mounted. Also, the --force option can be used to delete corrupted data and recover from the cluster
HINT: Check that all file systems are properly mounted. Also, the --force
option can be used to delete corrupted data and recover from the cluster
As the database administrator user (default is: dradmin), do the following on the DR:
cd /opt/vertica/bin
./admintools -t restart_node --host=<NODE_IP_ADDRESS> -d <DB_NAME> --force
./admintools -t restart_node --host=xxx.xxx.xxx.xxx -d drdata --force
Info: no password specified, using none
*** Restarting nodes for database drdata ***
Restarting host [xxx.xxx.xxx.xxx
] with catalog [v_drdata_node0003_catalog]
Issuing multi-node restart
Starting nodes:
v_drdata_node0003 (xxx.xxx.xxx.xxx
)
Starting Vertica on all nodes. Please wait, databases with a large catalog may take a while to initialize.
Node Status: v_drdata_node0003: (DOWN)
Node Status: v_drdata_node0003: (DOWN)
Node Status: v_drdata_node0003: (DOWN)
Node Status: v_drdata_node0003: (DOWN)
Node Status: v_drdata_node0003: (DOWN)
Node Status: v_drdata_node0003: (DOWN)
Node Status: v_drdata_node0003: (RECOVERING)
Node Status: v_drdata_node0003: (RECOVERING)
Node Status: v_drdata_node0003: (UP)
# ./admintools -t list_allnodes
Node | Host | State | Version | DB
-------------------+-------------------+-------+------------------+--------
v_drdata_node0001 | xxx.xxx.xxx.xxx
| UP | vertica-10.1.1.0 | drdata
v_drdata_node0002 | xxx.xxx.xxx.xxx
| UP | vertica-10.1.1.0 | drdata
v_drdata_node0003 | xxx.xxx.xxx.xxx
| UP | vertica-10.1.1.0 | drdata
If it is still not able to start, then there may be disk partition issues (check the above mentioned vertica.log) which will have to be resolved before restarting.