Seeing EDAC MC errors in messages on SSLV device

book

Article ID: 169222

calendar_today

Updated On:

Products

SSL Visibility Appliance Software

Issue/Introduction

In messages of SSLV device similar error message can be seen:

kernel: [12648643.728920] EDAC MC1: 1 CE error on CPU#1Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
kernel: [27410407.408606] EDAC MC1: 32758 CE error on CPU#1Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0) 


These lines report 1 respectively 32758 CE (Correctable Errors) detected in memory.
 

Cause

These messages are reported by EDAC kernel module whose goal is to detect and report hardware errors. Memory Correctable Errors (CE) and Uncorrectable Errors (UE) are the primary errors being harvested.

Detecting CE events, then harvesting those events and reporting them, can but must not necessarily be a predictor of future UE events. With CE events only, the system can and will continue to operate as no data has been damaged yet.

 

Resolution

It's recommended to preventively run Memtest86+ from the GRUB boot menu of SSLV device to reduce likelihood of potential future memory Uncorrectable Errors (UE) events and system panics.

Note: The GRUB boot loader is only available in the SSLV 3.x chain since SSLV 4.X uses a different bootloader.