You may see the following error in the messages log after running this command: grep reset /var/log/messages | grep fatal
Jun 6 04:00:01 localhost disk_subsystem: snlog: sn="x.x.x.x" id="DS" m="23" c="6" event="DISK_STATUS" category="HARDWARE" ip="x.x.x.x" model="R620" msg="Adapter 0; seqNum: 0x0005be9b; Time: Sat Jan 1 00:00:02 2000; Event Description: Controller encountered a fatal error and was reset; "
RAID Firmware is not up to date
Run: megacli -fwtermlog -dsply -a0 -nolog > fwterm.out
and attach the fwterm.out file to the case.
Also captured the logs from the RAID controller by running as root: megacli -fwtermlog -dsply -a0 -nolog
fwterm logs showed:
T1: EVT#376474-T1: 1=Firmware version 3.130.05-2086
01/01/00 0:00:02: EVT#376475-01/01/00 0:00:02: 345=Controller encountered a fatal error and was reset
01/01/00 0:00:02: Initializing the Temperature Monitor
Dell recommends that you upgrade the firmware for the internal H710 RAID controller. It will require 5-10 minutes to install the update and require a 10 minute reboot.
Download the PERC RAID Controller firmware for your specific system. Look for the Red Hat Linux Update Package (.BIN). For Example:
Download the Red Hat Linux Update Package (.BIN) file to /home on the sensor.
To update the firmware do the following:
This will take a few minutes, There will be verification that the firmware is the correct firmware for the hardware and lots of license details. Once it is done, it will reboot the system.
SA should come online after the reboot.