Passive check does not notify

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
User avatar
Pitone_Maledetto
Posts: 69
Joined: Fri Jul 01, 2016 4:11 am
Location: Liverpool, United Kingdom

Passive check does not notify

Post by Pitone_Maledetto »

Hi all,
I am running some passive checks on a Windows OS server using NRDS_Win to the NRDP daemon.
The problem that I am facing is that every HARD state changes I am not getting any alerts both via email and on the dedicated slack channel.
This is my service definition:

Code: Select all

###############################################################################
# SERVICE DEFINITIONS
###############################################################################

# PASSIVE CHECKS
define service{
        use                     generic-service
        host_name               mywindows10
        service_description     check_logs
        check_command           stale_critical
        normal_check_interval   1
        is_volatile             1
        check_freshness         1
        freshness_threshold     330
        max_check_attempts      1
        flap_detection_enabled  0
        }
This my generic-service:

Code: Select all

define service{
        name                            generic-service         ; The 'name' of this service template
        active_checks_enabled           1                       ; Active service checks are enabled
        passive_checks_enabled          1                       ; Passive service checks are enabled/accepted
        parallelize_check               1                       ; Active service checks should be parallelized (disabling this can lead to major performance problems)
        obsess_over_service             1                       ; We should obsess over this service (if necessary)
        check_freshness                 0                       ; Default is to NOT check service 'freshness'
        notifications_enabled           1                       ; Service notifications are enabled
        event_handler_enabled           1                       ; Service event handler is enabled
        flap_detection_enabled          1                       ; Flap detection is enabled
        process_perf_data               1                       ; Process performance data
        retain_status_information       1                       ; Retain status information across program restarts
        retain_nonstatus_information    1                       ; Retain non-status information across program restarts
        is_volatile                     0                       ; The service is not volatile
        check_period                    24x7                    ; The service can be checked at any time of the day
        max_check_attempts              3                       ; Re-check the service up to 3 times in order to determine its final (hard) state
        normal_check_interval           5                       ; Check the service every 5 minutes under normal conditions
        retry_check_interval            2                       ; Re-check the service every two minutes until a hard state can be determined
        contact_groups                  Infranagios             ; Notifications get sent out to everyone in the 'admins' group
        notification_options            w,u,c,r                 ; Send notifications about warning, unknown, critical, and recovery events
        notification_interval           15                      ; Re-notify about service problems every 10 minutes
        notification_period             24x7                    ; Notifications can be sent out at any time
        register                        0                       ; DONT REGISTER THIS DEFINITION
        }
and this is a snippet of my nagios log file:

Code: Select all

[1499358496] PASSIVE SERVICE CHECK: mywindows10;check_logs;2;CRITICAL - (1 errors in check_logfiles.protocol-2017-07-06-17-28-01) - ERROR timestamp appfailure 1
[1499358496] SERVICE ALERT: mywindows10;check_logs;CRITICAL;HARD;1;CRITICAL - (1 errors in check_logfiles.protocol-2017-07-06-17-28-01) - ERROR timestamp appfailure 1
I can't understand why I don't get any alerts through.
Any help is greatly appreciated.
Regards
"It is impossible to work in information technology without also engaging in social engineering"
Jaron Lanier
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Passive check does not notify

Post by tmcdonald »

It looks like the alert fires but there might be issues with the contacts or the notification command. Can you post definitions for those as well?
Former Nagios employee
User avatar
Pitone_Maledetto
Posts: 69
Joined: Fri Jul 01, 2016 4:11 am
Location: Liverpool, United Kingdom

Re: Passive check does not notify

Post by Pitone_Maledetto »

Hi tmcdonald,
just to say that the Infranagios contactgroup is working fine with all the others definitions.

Code: Select all

define contactgroup{
        contactgroup_name       Infranagios
        alias                   Infranagios
        members                 Infranagios,slack
        }
notification command running.

Code: Select all

# 'service-email-graph-int-en' command definition, sends
# HTML e-mails in English, includes Nagios URL, inline logo and graph
define command{
        command_name    notify-service-by-email
        command_line    /usr/local/nagios/libexec/nagios_send_service_mail.pl \
-p "company name" \
-f graph -u
}

# 'host-email-graph-int-en' command definition, sends
# HTML e-mails in English, includes Nagios URL, inline logo and graph
define command{
        command_name    notify-host-by-email
        command_line    /usr/local/nagios/libexec/nagios_send_host_mail.pl \
-p "company name" \
-f multi -u
}

# Slack command definition sends
define command {
      command_name     notify-service-by-slack
      command_line     /usr/local/nagios/libexec/slack_nagios -field slack_channel=#infranagios
}

define command {
      command_name     notify-host-by-slack
      command_line     /usr/local/nagios/libexec/slack_nagios -field slack_channel=#infranagios
}
Regards
"It is impossible to work in information technology without also engaging in social engineering"
Jaron Lanier
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Passive check does not notify

Post by scottwilkerson »

Is the host mywindows10 in a not OK state?

This would cause notifications to be suppressed. If you had dependencies it could as well
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
User avatar
Pitone_Maledetto
Posts: 69
Joined: Fri Jul 01, 2016 4:11 am
Location: Liverpool, United Kingdom

Re: Passive check does not notify

Post by Pitone_Maledetto »

Hi scottwilkerson,
yes indeed it is down but I was not worried about that since my test only involved the check_logs services.
So passive checks do not alert if the host is on a not OK state or also active checks behave this way?
The host is now up on a OK state and the alert notifications are coming through.
Thank you all for your help as always.
This thread can be closed.
Best Regards
"It is impossible to work in information technology without also engaging in social engineering"
Jaron Lanier
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Passive check does not notify

Post by scottwilkerson »

Here is a thread that offers some more information on the topic
https://support.nagios.com/forum/viewto ... 96#p102971
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Locked