/storage/seat disk 100% full on vCenter Server Appliance 6.x/7.x
search cancel

/storage/seat disk 100% full on vCenter Server Appliance 6.x/7.x

book

Article ID: 318931

calendar_today

Updated On:

Products

VMware vCenter Server

Issue/Introduction

Symptoms:
  • The vCenter Server Appliance services fail to start.
  • When connecting to the VCSA using the vSphere Web Client, there may be an error similar to:

    503 Service Unavailable
     
  • Running df -h on the vCenter Server Appliance shows the /dev/mapper/seat_vg-seat mounted on /storage/seat as 95% or more full.
  • In the /var/log/vmware/vpxd/vpxd.log file, see entries similar to:
    <YYYY-MM-DD>T<TIME> warning vpxd [7F447FF7E700] [Originator@6876 sub=Default opID=StatsTruncateExpiredPartitions-7460ee5a]
    [VdbStatement] Connection diagnostic data from driver is HY000:0:110:
    <YYYY-MM-DD>T<TIME> error vpxd[7F44865F3700] [Originator@6876 sub=Default] [VdbStatement] Execute result code: -1
    <YYYY-MM-DD>T<TIME> warning vpxd[7F44865F3700] [Originator@6876 sub=Default] [VdbStatement] SQL execution failed: INSERT INTO
    VPX EVENT (EVENT_ID, CHAIN_ID, EVENT_TYPE, EXTENDED_CLASS, CREATE_TIME, USERNAME, CATEGORY, VM_ID, VM_NAME, HOST_ID, HOST_NAME,
    COMPUTERESOURCE_ID, COMPUTERESOURCE TYPE, COMPUTERESOURCE NAME, DATACENTER_ID, DATACENTER NAME, DATASTORE_ID, DATASTORE NAME,
    NETWORK_ID, NETWORK_NAME, NETWORK_TYPE, DVS_ID, DVS_NAME, STORAGEPOD_ID, STORAGEPOD_NAME, CHANGE_TAG_ID) VALUES (?, ?, ?, ?, ?,
    2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, ?, ?, ?, ?)
    <YYYY-MM-DD>T<TIME> warning vpxd[7F44865F3700] [Originator@6876 sub=Default] [VdbStatement] Execution elapsed time: 3 ms
    <YYYY-MM-DD>T<TIME> warning vpxd[7F44865F3700] [Originator@6876 sub=Default] [VdbStatement] Statement diagnostic data from
    driver is 53100:0:7:ERROR: could not extend file "pg_tblspc/16396/PG_9.3_201306121/16384/16641": No space left on device;
    -- > Error while executing the query
  • In the /var/log/vmware/vpxd.log file of the vCenter Server Appliance 6.7.x, entries will be similar to:
    yyyy-mm-ddThh:mm:ss info vpxd[24517] [Originator@6876 sub=vpxdVdb] WarningThreshold: 80% ErrorThreshold: 95%.
    yyyy-mm-ddThh:mm:ss error vpxd[24517] [Originator@6876 sub=vpxdVdb] Insufficient free space for the Database (used: 96%; threshold: 95%)
    yyyy-mm-ddThh:mm:ss info vpxd[07808] [Originator@6876 sub=vpxdvpxdSignal] Signal 15 received, exiting
    yyyy-mm-ddThh:mm:ss info vpxd[07808] [Originator@6876 sub=Default] Initiating VMware VirtualCenter shutdown
    yyyy-mm-ddThh:mm:ss info vpxd[07711] [Originator@6876 sub=Default] Shutting down VMware VirtualCenter
  • In the /var/log/vmware/vpxd.log file of the vCenter Server 7.x, entries will be similar to:
    yyyy-mm-ddThh:mm:ss info vpxd[18939] [Originator@6876 sub=vpxdVdb] WarningThreshold: 80% ErrorThreshold: 95%.
    yyyy-mm-ddThh:mm:ss error vpxd[18939] [Originator@6876 sub=vpxdVdb] Space used on storage partition of Database exceeds
    threshold (used: 95%; threshold: 95%). Service-control request will stop vpxd
    yyyy-mm-ddThh:mm:ss info vpxd[18917] [Originator@6876 sub=vpxdvpxdSignal] Signal 15 received, exiting
    yyyy-mm-ddThh:mm:ss info vpxd[18917] [Originator@6876 sub=Default] Initiating VMware VirtualCenter shutdown
    yyyy-mm-ddThh:mm:ss info vpxd[18442] [Originator@6876 sub=Default] Shutting down VMware VirtualCenter


Note: The preceding log excerpts are only examples. Date, time, and environmental variables may vary depending on the environment.

Environment

VMware vCenter Server Appliance 6.7.x
VMware vCenter Server Appliance 6.0.x
VMware vCenter Server Appliance 6.5.x
VMware vCenter Server 7.0.x

Cause

This issue occurs due to a large amount of events collected on the vCenter Server Appliance filling the database. By default, when the space is 95% or more full the critical vmware-vpxd service will not be allowed to run to prevent the database from becoming corrupted.

The most frequent cause of this happening is Excessive Hardware health alarms being triggered for "Sensor -1 type" on ESXi hosts running vSphere 6.7 U3.

Resolution

To resolve this issue, you need to find which ESXi host is causing the events table to be filled. After this is done, the events tables must be truncated.

  • To find the ESXi host that is generating the events:
    1. Take a backup of the VCDB, see Back up and restore vCenter Server Appliance/vCenter Server 6.x vPostgres database
    2. Take a snapshot of the vCenter Server Appliance.
    3. Connect to vCenter Server Appliance through the console or using an SSH session and root credentials.
    4. Enable the shell by running this command:
      shell.set --enabled true
    5. Type shell and press Enter.
    6. Stop the vpxd service and content library service by running: 
      service-control --stop vmware-vpxd, service-control --stop vmware-content-library
    7. Run this command to log in to the vCenter Server Appliance database:
      /opt/vmware/vpostgres/current/bin/psql -d VCDB -U postgres
    8. Run this query to determine the source of the events:
      vCenter Server Appliance 6.0
      SELECT COUNT(EVENT_ID) AS NUMEVENTS, EVENT_TYPE, USERNAME FROM VPX_EVENT GROUP BY EVENT_TYPE, USERNAME ORDER BY NUMEVENTS DESC LIMIT 10;

      vCenter Server Appliance 6.5/6.7/7.0

      SELECT COUNT(EVENT_ID) AS NUMEVENTS, EVENT_TYPE, USERNAME FROM VPXV_EVENT_ALL GROUP BY EVENT_TYPE, USERNAME ORDER BY NUMEVENTS DESC LIMIT 10;


      Sample output

      vmfs.heartbeat.timedout |192.0.2.1 | 12191576
      vim.event.UserLogoutSessionEvent |192.0.2.1 | 1219121
      vim.event.VmAcquiredTicketEvent |192.0.2.1 | 15568
    9. To reclaim space type \q come out of vCenter Server Appliance database and run the command below
      su -c "/opt/vmware/vpostgres/current/bin/vacuumdb -d VCDB -e -v -f -U postgres > /tmp/vacuumdb.log"
      • Note: If you are logged in as Root user, "su -c" is not required.
    10. In the above example we see 12191576 events generated from the ESXi host with the IP of 192.0.2.1 related to: vmfs.heartbeat.timedout.
    11. This host must be further investigated to resolve the issue that is being reported.
  • To clean up the events in the vCenter Server Appliance database, take a snapshot of the VCSA VM, then follow the below steps.
    1. From the console or SSH session connected to vCenter Server Appliance, run these commands to truncate the event table data
      vCenter Server Appliance 6.0:
      truncate table vpx_event cascade;
      truncate table vpx_event_arg cascade;

      vCenter Server Appliance 6.5/6.7/7.0:

      VCDB# SELECT nspname || '.' || relname AS "relation", pg_size_pretty(pg_total_relation_size(C.oid)) AS "total_size" FROM pg_class C LEFT JOIN pg_namespace N ON (N.oid = C.relnamespace) WHERE nspname NOT IN ('pg_catalog', 'information_schema') AND C.relkind <> 'i' AND nspname !~ '^pg_toast' ORDER BY pg_total_relation_size(C.oid) DESC LIMIT 20;

      This will display the top 20 largest tables within the vCenter Server database. Truncate large tables individually.
      Sample output:

      VCDB=# truncate table vc.vpx_event_1 cascade;

      Note: The Solution on this KB is truncating only vc.vpx_event_x tables.

    2. Exit the vCenter Server Appliance database by running this command:
       \q
    3. Start the vpxd and content library service by running the command:
      service-control --start vmware-vpxd, service-control --start vmware-content-library
      
    4. Verify the space is reclaimed by running the df -h command.

      You see output similar to


 

Additional Information