Infrastructure service status unknown on DPU alarm
search cancel

Infrastructure service status unknown on DPU alarm

book

Article ID: 330580

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

Title: Alarm for infrastructure_service.service_status_unknown_on_dpu
Event ID: infrastructure_service.service_status_unknown_on_dpu

Alarm Description:

  • Purpose: Check if the status of the monitored service is normal.
  • Impact: The monitored service might have a heavy workload or is blocked.

Environment

VMware NSX

Resolution

Steps to resolve
For 4.0.0 and higher

Recommended Action:

  • Login the DPU as root:
    • Find out the ID of the DPU according to the DPU UUID:
      • In the NSX UI, navigate to "System | Fabric | Hosts | Clusters" or "System | Fabric | Hosts | Other Nodes"
      • Anchor the transport node by the "Reported by Node" in alarm, then go to "View Details | Monitor | System Usage"
      • Check the DPU under System Usage one by one to find the one reporting the alarm by checking its UUID
    • Login ESXi node(Reported by Node) at first:
      • ssh root {Reported by Node}
    • Jump to ESXio with DPU ID specified:
      • vim-cmd combinersvc/dpu_services/start TSM-SSH {DPU ID}
      • sshdpu -d {DPU ID}
  • Refer to Service Status Unknown seen on NSX-T UI alarm to resolve this alarm.

Maintenance window required for remediation? No