Checking the RMMSU for TEST and PROD lpars.
After the screen (RMMSU), running the command: "BACKWARD 01:00:00 SYSID PROD".
Recently starting to get errors when trying to get that information:
GSV2767E RMF ERB3XDRS PROD 20221224-040000 failed. Timeout - No data returned
RMF$022W A timeout occurred when gathering RMF data
RMF$018E SysID PROD not available
It doesn't happened all the time and cannot predict when it will happened again.
SYSVIEW 16.0 & 17.0 - z/OS supported releases -
To obtain the RMMSU data, SYSVIEW calls the RMF III sysplex data retrieval service ERB3XDRS.
Parameters passed on the call to RMF include the SMF id requested as well as a timeout value.
The timeout value comes from the RMFTIMEOUT setting in your SYSVIEW user profile.
Based on the timestamps in the provided LISTLOG snippet, the timeout looks to be 20 seconds for that user.
The IBM documentation for this parameter:
time_out
Specifies a time interval in seconds. If this time interval expires during the processing of the service, RMF returns to the caller with a corresponding return and reason code and partial data.
Based on these messages in listlog, ERB3XDRS returned RC=12 RS=30:
GSV2767E RMF ERB3XDRS PROD 20221224-040000 failed. Timeout - No data returned
RMF$022W A timeout occurred when gathering RMF data
Not sure what issue the IBM RMF code may be having, if any. If this just happens sporadically and it could be due to the invoking batch job maybe just not getting enough cpu service due to other workload running at that time.
Increasing the RMFTIMEOUT profile setting for whatever the SYSVIEW userid would be that the REXX api is running under.
These commands can be issued to change that profile, or just add a 'SET RMFTIMEOUT nnn' command in the REXX exec:
PROFILE CHANGE userid
SET RMFTIMEOUT nnn
then PF3 until you exit the PROFILE command.
The IBM documentation can be found there: