Hi,
 
I’ve opened a TAC for this as it’s quite important because if we miss the Syslog events, then we have nothing else which will tell us about equipment at a remote location having a major fault.
 
In the meantime, I found this: http://www.cisco.com/c/en/us/td/docs/routers/asr9000/mib/guide/asr9kmib/asr9kappa.html
 
Specifically:
 
MIBs Used for Physical Entity Management
 
I’m not very well versed (read: no clue at all) on how the various information is polled and correlated between the different MIBs by Observium, but - can you confirm/deny if the MIBs mentioned here are something would indeed be picked up by Observium? (Assuming Cisco were presenting it correctly as they claim in the document).
 
I’ll update with the outcome of the TAC in due course, if only for humour value.
 
Cheers,
 
From: observium [mailto:observium-bounces@observium.org] On Behalf Of Adam Armstrong
Sent: 20 January 2016 10:50
To: Observium Network Observation System <observium@observium.org>
Cc: Observium Network Observation System <observium@observium.org>
Subject: Re: [Observium] ASR9k fan failures not detected?
 
ASR, stuff missing from SNMP. Where have I heard this before? Oh yes, every ASR I've ever encountered!
You'd think Cisco would sort their shit out at some point!
We support CISCO-ENTITY-SENSOR-MIB, which is where these should be (and are on other devices), but Cisco's QA for ASR software is in the toilet.
I'd recommend opening a TAC case and shouting at them.
Adam.
Sent from BlueMail
 
 
Robert Williams
Custodian Data Centre
Email: Robert@CustodianDC.com
http://www.CustodianDC.com
 
 
On 20 Jan 2016, at 10:43, Robert Williams <Robert@CustodianDC.com> wrote:
Hi,
 
I’ve just noticed that fan failures on the ASR 9k are not detected by Observium. In fact, it seems have nothing at all under the ‘status’ tab?
 
The 6500s have over 20 entries in there for “Fan - Fail” or “PSU - Fail” or “I’m not facing magnetic north - Fail” etc. but the ASR9000 have no status tab whatsoever.
 
Reason for noticing this; is that we have a chassis which currently has a ‘Major-Alarm’ failure indicator LED on the RSP, due to a failed fan tray. Observium shows the device has being ‘OK’ and is not seeing either the Major alarm or the fan tray failure:
 
canb-server[151]: %PLATFORM-CANB_SERVER-3-ALARM_INDICATION : Raise alarm from CBC in slot 0/FT0/SP, alarm code CBC_ALRM_FT_LED_RED_FAILED
 
LED Information
---------------------------------------------
 
R/S/I   Modules LED             Status
0/RSP0/*
        host    Critical-Alarm  Off
        host    Major-Alarm     On
        host    Minor-Alarm     Off
        host    ACO             Off
        host    Fail            Off
 
 
        FAN0    FAN1    FAN2    FAN3    FAN4    FAN5    FAN6    FAN7    FAN8    FAN9    FAN10   FAN11   FAN12   FAN13
0/FT0/* (Speed)
        9900    9900    9870    9780    9870    9780    9780    10080   10080   9870    9990    9900    9810    9990
(fans are now OK again)
 
Is this a case of Cisco not exporting any of this useful (and rather critical) information properly? Would be good to graph fan speeds if they are exported. Or are other people getting these OK and it’s just us with the issue?
 
Cheers!
 
 
Robert Williams
Custodian Data Centre
Email: Robert@CustodianDC.com
http://www.CustodianDC.com
 
 
 
 

observium mailing list
observium@observium.org
http://postman.memetic.org/cgi-bin/mailman/listinfo/observium