Checks randomly not reaching hard state

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
burkm
Posts: 31
Joined: Thu Jan 21, 2016 5:10 pm

Re: Checks randomly not reaching hard state

Post by burkm »

Yes, I cloned the maint branch yesterday.

The problem I'm seeing is that sometimes checks return critical results up to the max check attempts, but remain in the SOFT state. Therefore it sends no notifications.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Checks randomly not reaching hard state

Post by scottwilkerson »

burkm wrote:The problem I'm seeing is that sometimes checks return critical results up to the max check attempts, but remain in the SOFT state. Therefore it sends no notifications.
Is the services host or any dependencies in a non-UP state?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
burkm
Posts: 31
Joined: Thu Jan 21, 2016 5:10 pm

Re: Checks randomly not reaching hard state

Post by burkm »

No, the particular service check I'm watching has no dependencies; nor does the host that it's on. The host has a dependent host, but both are up. I've double-checked that all notifications are enabled.

Here are some nagios.log messages for that service:

Code: Select all

[1536130094] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;1;SNMP CRITICAL - Drives *4*
[1536130214] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;2;SNMP CRITICAL - Drives *4*
[1536130335] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
[1536130455] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
[1536130575] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
[1536130695] SERVICE ALERT: Roentgenium;HP Drives;CRITICAL;SOFT;3;SNMP CRITICAL - Drives *4*
    ...
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Checks randomly not reaching hard state

Post by scottwilkerson »

is it possible for you to PM me your current status.dat ?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked