search cancel

NAS 9.34 crashes, restarts and throws Max. restarts error in log, unstable nas probe

book

Article ID: 243433

calendar_today

Updated On:

Products

DX Unified Infrastructure Management (Nimsoft / UIM) CA Unified Infrastructure Management On-Premise (Nimsoft / UIM) CA Unified Infrastructure Management SaaS (Nimsoft / UIM) Unified Infrastructure Management for Mainframe

Issue/Introduction

I thought the nas was stable now, but apparently not. Today, i see it has stopped running again and is in error state. nas restarts and throws Max.restarts error in log. i will add the log that i have and the config, this really needs to get sorted out, we need a stable monitoring platform.

Jun  8 10:42:49:291 [21564] 0 nas: ****************[ Starting ]****************
Jun  8 10:42:49:291 [21564] 0 nas: nas 9.34, Nov 28 2021
Jun  8 10:42:49:291 [21564] 0 nas: Copyright  2013, CA. All rights reserved.
Jun  8 10:42:50:293 [21564] 1 nas: nimNamedSession: failed to connect session to 10.x.x.xxx:48033 10061
Jun  8 10:42:50:299 [21564] 1 nas: port=48033 PID=29752
Jun  8 10:42:50:311 [21564] 0 nas: No longer checking for restricted hub license
Jun  8 10:42:50:311 [21564] 0 nas: NAS Services called using mode: 1
Jun  8 10:42:50:312 [21564] 0 nas: Scripting license is available.
Jun  8 10:42:50:313 [21564] 1 nas: passive maint thread - has started in thread 2204
Jun  8 10:42:50:316 [21564] 0 nas: Failed to read a valid probe_crypto_mode from controller. Assuming pre-FIPS and using TWO_FISH
Jun  8 10:42:50:317 [2204] 0 nas: maint:  Successful registration to: /nnnnnnn/PIx/nnnnnP03/maintenance_mode
Jun  8 10:42:50:445 [21564] 0 nas: corrInitialize: Last Alive Time: 1654677748
Jun  8 10:42:50:446 [29780] 1 nas: Transaction-log database housekeeping scheduled to Thu Jun 09 00:30, 2022
Jun  8 10:42:50:901 [18120] 1 nas: Activity-log administration used 407ms, status: OK
Jun  8 10:42:51:034 [9552] Controller: Max. restarts reached for probe 'nas' (command = nas.exe)

Cause

- nas 9.34 defect

Environment

  • Release : DX UIM 20.4
  • Component : UIM NAS
  • nas 9.34

Resolution

There was an issue with nas v9.34 which has been fixed in HF1.

You can download the hotfix here, and then apply it to the Primary hub.

https://support.broadcom.com/web/ecx/support-content-notification/-/external/content/release-announcements/CA-Unified-Infrastructure-Management-Hotfix-Index/7233?r=2&r=1

Direct Link for download:

nas_9.34_HF1.zip

Fixed Defects:

DE524288: 32961987,32978659,32979741: nas 9.34 crash on ntdll with custom script
DE526170: 32990760 - Merge/IBM - defect in nas.cfx subsystems section ('override' from cfx is removed, this can be seen in fresh installation only)
 
Steps to resolve:
 
The nas probe would not successfully deactivate so we had to restart it and then quickly deactivate it.

To workaround the issue we followed the steps listed below:

  1. Restart nas
  2. Deactivate nas
  3. Deactivate alarm_enrichment if still running
  4. Rt-click and delete nas and alarm_enrichment
  5. Delete the nas folder from the file system (we noticed 1 LUA script was running so we had to skip it). Save the nas.cf and scripts if necessary.
  6. Drop the following nas tables via MS SQL Server studio or another DB studio/tool:
DROP TABLE NAS_VERSION
DROP TABLE NAS_ALARMS
DROP TABLE NAS_TRANSACTION_SUMMARY
DROP TABLE NAS_TRANSACTION_LOG
DROP TABLE NAS_NOTES
DROP TABLE NAS_ALARM_NOTE
 
     7. Deploy nas_9.34_HF1
     8. Activate alarm_enrichment
     9. Activate nas
   10. Check the nas.log and it should start up with no issues/errors and successfully connect to the hub.

Additional Information

Sign up for Proactive notifications here:
Sign up for Proactive Notifications to receive emails regarding important notifications, updates and release information regarding your Broadcom Software.