PostgreSQL routinely causes CPU spikes requiring a reboot to resolve in VMware Cloud Director
search cancel

PostgreSQL routinely causes CPU spikes requiring a reboot to resolve in VMware Cloud Director

book

Article ID: 388803

calendar_today

Updated On:

Products

VMware Cloud Director

Issue/Introduction

Over time the CPU usage of a VMware Cloud Director primary database cell will reach >90%. This will be seen with errors in the logs such as:
2025-01-16 16:42:26,932 | ERROR    | stats-activity-pool-1996  | CustomCassandraStatsReceiver   | Error processing entity stats for vc:<vCenter UUID> | activity=(com.vmware.vcloud.metrics.sta
tsfeeder.StatsFeederCollectorActivity,urn:uuid:<Activity UUID>)
java.lang.NullPointerException
        at com.vmware.vcloud.statsfeeder.cassandra.CustomCassandraStatsReceiver.receiveStats(CustomCassandraStatsReceiver.java:60)
        at com.vmware.vcloud.metrics.statsfeeder.StatsFeederCollectorActivity$StatsFeederCollectorPhase.invoke(StatsFeederCollectorActivity.java:197)
        at com.vmware.vcloud.activity.executors.ActivityRunner.runPhase(ActivityRunner.java:175)
        at com.vmware.vcloud.activity.executors.ActivityRunner.run(ActivityRunner.java:112)
        at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
        at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
        at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
        at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
        at java.base/java.lang.Thread.run(Thread.java:829)

Environment

VMware Cloud Director 10.x

Cause

The Cassandra DB integration with VMware Cloud Director is causing PostgreSQL queries to hang. This is causing PostgreSQL itself to consume more resources on the VM. 

Resolution

Remove the integration with Cassandra DB. On the cells run:
/opt/vmware/vcloud-director/bin/cell-management-tool cassandra --clean