Lag Member Down on DPU seen on NSX-T UI Alarm
search cancel

Lag Member Down on DPU seen on NSX-T UI Alarm

book

Article ID: 330468

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

Title: Alarm for transport_node_health.lag_member_down_on_dpu
Event ID: transport_node_health.lag_member_down_on_dpu


Alarm Description:

  • Purpose: Monitoring lag member status on ESXi host.
  • Impact: Lag member down will impact on connectivity and traffic flow of network.

 



Environment

VMware NSX

Resolution

Steps to Resolve

For 4.0.0 and higher

Recommended Action: 

Login the DPU as root:

  • Find out the ID of the DPU according to the DPU UUID:
    • In the NSX UI, navigate to "System | Fabric | Hosts | Clusters" or "System | Fabric | Hosts | Other Nodes"
    • Anchor the transport node by the "Reported by Node" in alarm, then go to "View Details | Monitor | System Usage"
    • Check the DPU under System Usage one by one to find the one reporting the alarm by checking its UUID
  • Login ESXi node(Reported by Node) at first:
    • ssh root {Reported by Node}
  • Jump to ESXi with DPU ID specified:
    • vim-cmd combinersvc/dpu_services/start TSM-SSH {DPU ID}
    • sshdpu -d {DPU ID}

Refer to ESXi part of Alarm for LACP reporting member down to resolve this alarm.

Maintenance window required for remediation? No

Additional Information

This alarm requires VMware ESXi 8 or higher with DPU installed. 

VMware vSphere Distributed Services Engine and Networking Acceleration by Using DPUs