Page 1 of 1

Passive check does not notify

Posted: Thu Jul 06, 2017 11:36 am
by Pitone_Maledetto
Hi all,
I am running some passive checks on a Windows OS server using NRDS_Win to the NRDP daemon.
The problem that I am facing is that every HARD state changes I am not getting any alerts both via email and on the dedicated slack channel.
This is my service definition:

Code: Select all

###############################################################################
# SERVICE DEFINITIONS
###############################################################################

# PASSIVE CHECKS
define service{
        use                     generic-service
        host_name               mywindows10
        service_description     check_logs
        check_command           stale_critical
        normal_check_interval   1
        is_volatile             1
        check_freshness         1
        freshness_threshold     330
        max_check_attempts      1
        flap_detection_enabled  0
        }
This my generic-service:

Code: Select all

define service{
        name                            generic-service         ; The 'name' of this service template
        active_checks_enabled           1                       ; Active service checks are enabled
        passive_checks_enabled          1                       ; Passive service checks are enabled/accepted
        parallelize_check               1                       ; Active service checks should be parallelized (disabling this can lead to major performance problems)
        obsess_over_service             1                       ; We should obsess over this service (if necessary)
        check_freshness                 0                       ; Default is to NOT check service 'freshness'
        notifications_enabled           1                       ; Service notifications are enabled
        event_handler_enabled           1                       ; Service event handler is enabled
        flap_detection_enabled          1                       ; Flap detection is enabled
        process_perf_data               1                       ; Process performance data
        retain_status_information       1                       ; Retain status information across program restarts
        retain_nonstatus_information    1                       ; Retain non-status information across program restarts
        is_volatile                     0                       ; The service is not volatile
        check_period                    24x7                    ; The service can be checked at any time of the day
        max_check_attempts              3                       ; Re-check the service up to 3 times in order to determine its final (hard) state
        normal_check_interval           5                       ; Check the service every 5 minutes under normal conditions
        retry_check_interval            2                       ; Re-check the service every two minutes until a hard state can be determined
        contact_groups                  Infranagios             ; Notifications get sent out to everyone in the 'admins' group
        notification_options            w,u,c,r                 ; Send notifications about warning, unknown, critical, and recovery events
        notification_interval           15                      ; Re-notify about service problems every 10 minutes
        notification_period             24x7                    ; Notifications can be sent out at any time
        register                        0                       ; DONT REGISTER THIS DEFINITION
        }
and this is a snippet of my nagios log file:

Code: Select all

[1499358496] PASSIVE SERVICE CHECK: mywindows10;check_logs;2;CRITICAL - (1 errors in check_logfiles.protocol-2017-07-06-17-28-01) - ERROR timestamp appfailure 1
[1499358496] SERVICE ALERT: mywindows10;check_logs;CRITICAL;HARD;1;CRITICAL - (1 errors in check_logfiles.protocol-2017-07-06-17-28-01) - ERROR timestamp appfailure 1
I can't understand why I don't get any alerts through.
Any help is greatly appreciated.
Regards

Re: Passive check does not notify

Posted: Thu Jul 06, 2017 4:07 pm
by tmcdonald
It looks like the alert fires but there might be issues with the contacts or the notification command. Can you post definitions for those as well?

Re: Passive check does not notify

Posted: Fri Jul 07, 2017 7:36 am
by Pitone_Maledetto
Hi tmcdonald,
just to say that the Infranagios contactgroup is working fine with all the others definitions.

Code: Select all

define contactgroup{
        contactgroup_name       Infranagios
        alias                   Infranagios
        members                 Infranagios,slack
        }
notification command running.

Code: Select all

# 'service-email-graph-int-en' command definition, sends
# HTML e-mails in English, includes Nagios URL, inline logo and graph
define command{
        command_name    notify-service-by-email
        command_line    /usr/local/nagios/libexec/nagios_send_service_mail.pl \
-p "company name" \
-f graph -u
}

# 'host-email-graph-int-en' command definition, sends
# HTML e-mails in English, includes Nagios URL, inline logo and graph
define command{
        command_name    notify-host-by-email
        command_line    /usr/local/nagios/libexec/nagios_send_host_mail.pl \
-p "company name" \
-f multi -u
}

# Slack command definition sends
define command {
      command_name     notify-service-by-slack
      command_line     /usr/local/nagios/libexec/slack_nagios -field slack_channel=#infranagios
}

define command {
      command_name     notify-host-by-slack
      command_line     /usr/local/nagios/libexec/slack_nagios -field slack_channel=#infranagios
}
Regards

Re: Passive check does not notify

Posted: Fri Jul 07, 2017 11:57 am
by scottwilkerson
Is the host mywindows10 in a not OK state?

This would cause notifications to be suppressed. If you had dependencies it could as well

Re: Passive check does not notify

Posted: Fri Jul 07, 2017 12:46 pm
by Pitone_Maledetto
Hi scottwilkerson,
yes indeed it is down but I was not worried about that since my test only involved the check_logs services.
So passive checks do not alert if the host is on a not OK state or also active checks behave this way?
The host is now up on a OK state and the alert notifications are coming through.
Thank you all for your help as always.
This thread can be closed.
Best Regards

Re: Passive check does not notify

Posted: Fri Jul 07, 2017 1:29 pm
by scottwilkerson
Here is a thread that offers some more information on the topic
https://support.nagios.com/forum/viewto ... 96#p102971