CPU starvation error in CA Directory Logs
search cancel

CPU starvation error in CA Directory Logs

book

Article ID: 136444

calendar_today

Updated On:

Products

CA Single Sign On Secure Proxy Server (SiteMinder) CA Single Sign On Agents (SiteMinder) CA Single Sign On Federation (SiteMinder) CA Single Sign On SOA Security Manager (SiteMinder) CA Single Sign-On SITEMINDER CA Directory

Issue/Introduction

 

When running CA Directory with Policy Server, CA Directory gets hung
and doesn't respond anymore, and the CA Directory logs shows errors :

  [0] 20190722.133546.086 DSA_E3730 CPU Seconds 45 has fallen below
  threshold 55, CPU starvation detected

  [0] 20190722.134151.917 DSA_E3740 Stats log entry overdue by 771
  seconds has exceeded threshold 5, CPU starvation detected

  [...]

  [0] 20190722.142923.182 DSA_E3740 Stats log entry overdue by 335
  seconds has exceeded threshold 5, CPU starvation detected

  [9] 20190722.142923.191 DSA_I2695 Multiwrite-DISP: Update from
  'server6' applied

  [2] 20190722.142930.193 DSA_I2695 Multiwrite-DISP: Update from
  'server7' applied

  [6] 20190722.143031.960 DSA_E2735 Multiwrite-DISP: Unable to
  synchronize with peer 'server3'

  [4] 20190722.143031.960 DSA_E2735 Multiwrite-DISP: Unable to
  synchronize with peer 'server3'

  [...]


  [0] 20190722.152622.064 DSA_E2735 Multiwrite-DISP: Unable to
  synchronize with peer 'server3'

  [0] 20190722.152622.064 DSA_E2735 Multiwrite-DISP: Unable to
  synchronize with peer 'server3'

  [0] 20190722.153632.732 DSA_E2735 Multiwrite-DISP: Unable to
  synchronize with peer 'server3'

  [0] 20190722.153632.732 DSA_E2735 Multiwrite-DISP: Unable to
  synchronize with peer 'server3'

  [...]

  [0] 20190722.161751.631 DSA_E3730 CPU Seconds 6 has fallen below
  threshold 55, CPU starvation detected

  [0] 20190722.161751.632 DSA_E3740 Stats log entry overdue by 1024
  seconds has exceeded threshold 5, CPU starvation detected

 

Environment

 

Policy Server 12.8;
CA Directory 14.0;

 

Cause

 

One of the server reaches credit limit and as such the other lost
connection and synchronization with it :

server4_warn.log

  [0] 20190722.133546.086 WARN : MW-DISP not in sync for 'server3'
  [0] 20190722.133546.087 WARN : Attempting to send update to peer 'server3'
  [0] 20190722.133546.087 WARN : MW-DISP not in sync for 'server3'
  [0] 20190722.133546.087 WARN : Attempting to send update to peer 'server3'
  [6] 20190722.133546.087 WARN : Remote DSA 'server6' aborted
  [5] 20190722.133546.087 WARN : Remote DSA 'server7' aborted
  [2] 20190722.133546.087 WARN : Remote DSA 'server7' aborted
  [0] 20190722.134740.459 WARN : cid 5945 has been reused as cid 5946
  [0] 20190722.134740.459 WARN : MW-DISP not in sync for 'server3'
  [0] 20190722.134740.459 WARN : Attempting to send update to peer 'server3'
  [1] 20190722.134740.459 WARN : Remote DSA 'server6' aborted
  [5] 20190722.134740.460 WARN : Abandoning op 13810736
  [5] 20190722.134740.460 WARN : Op invalid - lost?

server4_warn.log

  [0] 20190722.134740.457 WARN : HTTP connection timed out
  [0] 20190722.140216.266 WARN : cid 3967 has been reused as cid 3968
  [0] 20190722.141446.577 WARN : Idle association 4356 (server4) timed out after 609 seconds
  [0] 20190722.143934.379 WARN : HTTP connection timed out
  [0] 20190722.145837.576 WARN : netcon 0x55a0f1a1dfd8 not found
  [0] 20190722.155947.470 WARN : HTTP connection timed out

server4_warn.log

  [0] 20190722.134740.459 WARN : cid 3843 has been reused as cid 3844
  [0] 20190722.134740.459 WARN : Remote DSA 'server4' aborted
  [0] 20190722.134740.459 WARN : Marking DSA 'server4' as down
  [0] 20190722.134740.459 WARN : Remote DSA 'server4' aborted
  [0] 20190722.134740.459 WARN : Marking DSA 'server4' as down
  [0] 20190722.134740.463 WARN : Remote DSA 'server4' aborted
  [0] 20190722.134740.495 WARN : Marking DSA 'server4' as down

  [...]

[8] 20190722.140613.110 WARN : Credit limit reached #1936 "cn=<Admin>,ou=siteminder,o=mycompany,c=com" 10.0.0.151:11111
  [0] 20190722.141036.913 WARN : userOpTimedOut 1936/40
  [0] 20190722.141036.913 WARN : userOpTimedOut 1936/41

  [...]

[1] 20190722.142956.036 WARN : Credit limit reached #2003 "cn=<Admin>,ou=siteminder,o=mycompany,c=com" 10.0.0.151:11112

 

Resolution

 

- Increase the "Credit Limit" in order to prevent the issue to be
  reproduce in the future (1).

 

Additional Information

(1)

    Identity Manager and CA Directory Credit Limit
    https://knowledge.broadcom.com/external/article?articleId=27259