NSX Manager and Edge nodes report crash with coredump generated in services like top and nvpapi.py.
search cancel

NSX Manager and Edge nodes report crash with coredump generated in services like top and nvpapi.py.

book

Article ID: 368129

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • NSX Manager node reports below alarm for crash of the toplogger module:



  • No functional impact is observed, the service will auto-restart.
  • The core dump will be located in the folder /var/log/core/ and the file name will be similar to core.toplogger-cpu.xxxxxx.
  • In NSX Manager /var/log/kern.log, we see entries similar to below:

    2024-04-01T20:51:01.544Z nsx_mgr_hostname kernel - - - [2956827.155283] signal_fault: 13 callbacks suppressed
    2024-04-01T20:51:01.578Z nsx_mgr_hostname kernel - - - [2956827.155306] toplogger-cpu[4094496] bad frame in rt_sigreturn frame:000077#####f3fb8 ip:761######30b sp:77a######ca0 orax:ffffffffffffffff in libc-2.31.so[768#####0000+###000]
    2024-04-01T20:51:01.583Z nsx_mgr_hostname kernel - - - [2956827.155392] grsec: Segmentation fault occurred at 0000000000000000 in /etc/cron.minutes/toplogger-cpu[toplogger-cpu:4094496] uid/euid:0/0 gid/egid:0/0, parent /usr/bin/run-parts[run-parts:4094466] uid/euid:0/0 gid/egid:0/0

Environment

VMware NSX 4.x

Cause

The python-gevent process crashes and generates a core-dump which, can potentially impact any python based NSX services.

Resolution

This issue is resolved in VMware NSX 4.1.1, available at Broadcom downloads. If you are having difficulty finding and downloading software, please review the Download Broadcom products and software KB.

For steps to clear the alarm via removal of the core dump please see the KB -  Application on NSX node has crashed alarm.