We've got a pair of new alerts for a Dell file server having several attached drive arrays. One alert clearly refers to the array controller; it is not clear to what the other refers, other than that it is storage-related. These alerts are: Drive ID 4 storage 3h 17m 52s ago alert fail Drive Controller PERC H730P Mini (Embedded) 3 storage 3h 19m 5s ago alert fail
These alerts appeared immediately after starting the Systemd service dsm_sa_datamgrd, which had failed for an unknown reason. I believe this service runs an agent for the Dell OpenManage monitoring portal. Even after the restart of this service, that portal is reporting that it cannot communicate with the system.
The iDRAC for this system reports no faults, and all relevant lights on it are green. The system appears to be working, in that the files it serves are available as normal.
Do you have any ideas how to take this investigation forward? I see that a similar thread https://lists.observium.org/hyperkitty/list/observium@lists.observium.org/me... was opened nearly a year ago, but this did not receive any responses.
participants (1)
-
eey@pml.ac.uk