This article provides a list of steps to validate the health for the following appliances:
If you are deploying new instances of the previously mentioned products, follow the article Troubleshooting VMware Aria Automation cloud proxies and On-Premises appliance deployments.
kubectl get nodes
kubectl get pod -n prelude
vracli service status
vracli status first-boot
/opt/health/run.sh
vracli disk-mgr
df -hi
vracli cluster exec -- bash -c 'current_node; vracli disk-mgr; exit 0'
find ID1 -size +100M -exec du -h {} \; | less
Where ID1 is the name of the partition.
For Example:
find / -size +100M -exec du -h {} \; | less
find /data-size +100M -exec du -h {} \; | less
vracli dev psql Type: yes SELECT pg_database.datname as "database_name", pg_database_size(pg_database.datname)/1024/1024 AS size_in_mb FROM pg_database ORDER by size_in_mb DESC;
Once you have identified the largest tables, isolate the tables in that database consuming the largest amount of space by running the following command replacing ID2 with the database name:
\c ID2
SELECT nspname || '.' || relname AS "relation", pg_size_pretty(pg_total_relation_size(C.oid)) AS "total_size" FROM pg_class C LEFT JOIN pg_namespace N ON (N.oid = C.relnamespace) WHERE nspname NOT IN ('pg_catalog', 'information_schema') AND C.relkind <> 'i' AND nspname !~ '^pg_toast' ORDER BY pg_total_relation_size(C.oid) DESC LIMIT 20;
Articles related to database growth issues:
nslookup $( iface-ip eth0)
nslookup $( uname -n)
Run the following commands on each appliance in the cluster:
/usr/bin/dig +noall +answer +nocookie -x $( iface-ip eth0 )
/usr/bin/dig +noall +answer +noedns -x $( iface-ip eth0 )
/usr/bin/dig +noall +answer -x $( iface-ip eth0 )
A helpful health validation set of tools may be found in the Automation Orchestrator Control Center.
vracli ntp show-config
vracli ntp status
Validating the product license:
vracli license
vracli version
See Build numbers and versions for VMware Aria Automation (formerly VMware vRealize Automation) for additional information.