Internode sessions experiencing communication delays after network issues

book

Article ID: 86122

calendar_today

Updated On:

Products

CA Automic Dollar Universe

Issue/Introduction

Affects Release version(s): 5

Error Message :

2009-06-14 16:06:56 0005897/uxech /CALL_UXIOSRV /000000000 - u_io_callsrv(on node1,COMPANY1,A) returns error -1
br/> 2009-06-14 16:06:56 0005897/uxech /GAI90A32 /134455874 - %UNI_-E-U_EGAI90A3225, Network down, can't set packet N°
br/> 2009-06-14 16:06:56 0005897/uxech /u_io_callsrv /000000000 - u_connect error : Errno syserror 239: Connection refused (host [node1])

Patch level detected:Dollar Universe 5.6
Product Version: Dollar.Universe 5.6.0 FX25010

Description :During a network outage that lasts some time, the universe.log is flooded with exchanger error messages. After the network has recovered, cross node sessions start to run.However, the jobs that should run on remote nodes only start 20 minutes later.

Cause

Root Cause: The reason for this delay is due to the exchanger data files are filled with pending requests. During the network outage, the DUAS is still working, and trying to send out several requests to remote nodes. However, it could not so more and more requests are accumulated in the exchanger data files.

Environment

OS: All

Resolution

This kind of delay should dissipate once the network is stable.

Fix Status: No Fix

Additional Information

Workaround :
N/A