'Stats primary election' and 'Hosts with connectivity issues' vSAN Skyline Health alert triggers frequently and auto resolves
search cancel

'Stats primary election' and 'Hosts with connectivity issues' vSAN Skyline Health alert triggers frequently and auto resolves

book

Article ID: 387278

calendar_today

Updated On:

Products

VMware vSAN

Issue/Introduction

Symptoms:

Environment

Vmware Aria Operations 8.16.x

Vmware Aria Operations 8.17.x

VMware vSAN 8.x

Cause

  • The 'Stats primary election' alert is triggered because the vsanmgmtd service on the vSAN hosts go to a not responding state frequently.

  • This is due to continuous API calls with unsupported parameters, which run into an error while fetching the details being made on the service, thereby overloading it.

  • From the ESXi host's /var/run/log/vsanmgmt.log the unsupported parameter call can be seen:

    YYYY-MM-DDTHH:MM:SS.SSSZ error vsand[2110597] [opID=########-#### statsdb::Run] When run command execute for mode normalMode, met exception in DB thread data processing: no such column: throughputDevRead, out is no such column: throughputDevRead Traceback (most recent call last):   File "/usr/lib/vmware/vsan/perfsvc/statsdb.py", line 4535, in Run sqlite3.OperationalError: no such column: throughputDevRead

  • These API calls are coming from VMware Aria Operations.

  • Due to these calls the hostd service on the host also goes down causing the 'Hosts with connectivity issues' alert.

  • This is a known issue found in VMware Aria Operations 8.16 (prior to Hot Fix 2) & VMware Aria Operations 8.17 (prior to Hot Fix 1) where it queries the vSAN hosts with an unsupported parameter.

Resolution

To resolve this issue:

  1. Upgrade VMware Aria Operations to either 8.16 Hot Fix 2 or 8.17 Hot Fix 1.

  2. Delete and re-create the vsan stats object- SAN Health Service - Performance Service - Stats master election check.