5 minute power failure support
Posted: Tue Jan 19, 2021 9:47 am
Hi Everyone
i am a noob with Nagios and i am struggling to write a trap or service for the following scenario.
i am monitoring remote sites ping checks every minute and passive traps. (this poll rate is a requirement so cannot be changed)
The site will send a power alert passive trap (below) if AC power fails.
snmptrap -v 2c -c public 192.168.2.xxx '' netSnmpExampleHeartbeatNotification netSnmpExampleHeartbeatName s "SITE_POWER_CRITICAL."
and the site then has a 5 minute battery backup , if power is restored then the following trap is sent.
snmptrap -v 2c -c public 192.168.2.xxx '' netSnmpExampleHeartbeatNotification netSnmpExampleHeartbeatName s "SITE_POWER_OK."
what i then need is to have a service that would flag on alert if the above trap has not be reset after a 5 minute period
or potential other way ?
if the ping check, (every 1 minute) when it goes CRITICAL. again if this has not reset after 5 minutes to OK then raise an service alert.
or if you know of any other way .
ie
Looking at the Service Status screen if the Duration of a Power CRITICAL Alarm is more than 5 minutes i want to raise a permanent power failure alarm
any help would be greatly appreciated.
regards
Richie
i am a noob with Nagios and i am struggling to write a trap or service for the following scenario.
i am monitoring remote sites ping checks every minute and passive traps. (this poll rate is a requirement so cannot be changed)
The site will send a power alert passive trap (below) if AC power fails.
snmptrap -v 2c -c public 192.168.2.xxx '' netSnmpExampleHeartbeatNotification netSnmpExampleHeartbeatName s "SITE_POWER_CRITICAL."
and the site then has a 5 minute battery backup , if power is restored then the following trap is sent.
snmptrap -v 2c -c public 192.168.2.xxx '' netSnmpExampleHeartbeatNotification netSnmpExampleHeartbeatName s "SITE_POWER_OK."
what i then need is to have a service that would flag on alert if the above trap has not be reset after a 5 minute period
or potential other way ?
if the ping check, (every 1 minute) when it goes CRITICAL. again if this has not reset after 5 minutes to OK then raise an service alert.
or if you know of any other way .
ie
Looking at the Service Status screen if the Duration of a Power CRITICAL Alarm is more than 5 minutes i want to raise a permanent power failure alarm
any help would be greatly appreciated.
regards
Richie