Page 2 of 2
Re: Checks randomly not reaching hard state
Posted: Wed Sep 05, 2018 1:06 pm
by burkm
Yes, I cloned the maint branch yesterday.
The problem I'm seeing is that sometimes checks return critical results up to the max check attempts, but remain in the SOFT state. Therefore it sends no notifications.
Re: Checks randomly not reaching hard state
Posted: Wed Sep 05, 2018 1:15 pm
by scottwilkerson
burkm wrote:The problem I'm seeing is that sometimes checks return critical results up to the max check attempts, but remain in the SOFT state. Therefore it sends no notifications.
Is the services host or any dependencies in a non-UP state?
Re: Checks randomly not reaching hard state
Posted: Wed Sep 05, 2018 3:53 pm
by burkm
No, the particular service check I'm watching has no dependencies; nor does the host that it's on. The host has a dependent host, but both are up. I've double-checked that all notifications are enabled.
Here are some nagios.log messages for that service:
Code: Select all
[1536130094] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;1;SNMP CRITICAL - Drives *4*
[1536130214] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;2;SNMP CRITICAL - Drives *4*
[1536130335] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
[1536130455] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
[1536130575] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
[1536130695] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
...
Re: Checks randomly not reaching hard state
Posted: Wed Sep 05, 2018 4:17 pm
by scottwilkerson
is it possible for you to PM me your current status.dat ?