NETAPP + SNMP TIMEOUTS + UNNECESSARY NOTIFICATIONS
Posted: Wed Jul 10, 2013 6:22 am
|||| Nagios 3.2.3 ||||
Hi everybody
i'm just trying to figure out how to fix this:
Our Netapp Storage is not reliable on SNMP, missing lots of checks due to timeout (already engaged netapp's support, but with very low expectations on their fixing it
The timeout reports UNKNOWN STATE (as defined by service_check_timeout_state=u) and when a volume is already in HARD WARNING / CRITICAL STATE, this happened to be a HARD STATE CHANGE and a notification is correctly sent out. Howerver, this happens constantly messing up good notifications
Would be nice if timeouts < N do not generate a HARD STATE CHANGE, while timeouts > N would mean something more serious and it's fine if it pops a notification.
I don't think this is recognizable as flapping...is it?
Any idea?
Thanks
+Emanuele
Hi everybody
i'm just trying to figure out how to fix this:
Our Netapp Storage is not reliable on SNMP, missing lots of checks due to timeout (already engaged netapp's support, but with very low expectations on their fixing it
Code: Select all
[1373448417] SERVICE ALERT: NETAPPSAN ;check_manu_test;UNKNOWN;SOFT;1;NAF UNKNOWN - Timeout - no SNMP answer from 192.168....Code: Select all
[1373449127] SERVICE ALERT: NETAPPSAN ;check_manu_test;CRITICAL;HARD;5;NAF CRITICAL - 1 CRITICAL: vol_data:volmanutest, 2 OK: vol_snap:volmanutest vol_files:volmanutest
[1373449127] SERVICE NOTIFICATION: ebardelli-email;NETAPPSAN ;check_manu_test;CRITICAL;notify-service-by-email;NAF CRITICAL - 1 CRITICAL: vol_data:volmanutest, 2 OK: vol_snap:volmanutest vol_files:volmanutest
[1373449457] SERVICE ALERT: NETAPPSAN ;check_manu_test;UNKNOWN;HARD;5;(Service Check Timed Out)
[1373450337] SERVICE ALERT: NETAPPSAN ;check_manu_test;CRITICAL;HARD;5;NAF CRITICAL - 1 CRITICAL: vol_data:volmanutest, 2 OK: vol_snap:volmanutest vol_files:volmanutest
[1373450337] SERVICE NOTIFICATION: ebardelli-email;NETAPPSAN ;check_manu_test;CRITICAL;notify-service-by-email;NAF CRITICAL - 1 CRITICAL: vol_data:volmanutest, 2 OK: vol_snap:volmanutest vol_files:volmanutest
I don't think this is recognizable as flapping...is it?
Any idea?
Thanks
+Emanuele