Nagios reporting host down but its not?

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Alan
Posts: 86
Joined: Wed Aug 21, 2019 4:14 pm

Nagios reporting host down but its not?

Post by Alan »

I have been getting a few issues with Nagios sending an email saying a server is down but it actually is not. I am able to login to it. I am thinking this may have to do with my configuration but I am not sure. I am using ncpa.cfg and I have a Ping service setup there I am pretty sure it was just the default which is:

Code: Select all

define service {
    host_name               Svr-Data
    service_description     Ping
    check_command           check_ping!60.0,5%!100.0,10%
    max_check_attempts      5
    check_interval          5
    retry_interval          1
    check_period            24x7
    notification_interval   60
    notification_period     24x7
    contact_groups          admins2
    register                1
}
I did find a form and someone suggested to look at the command.cfg file and said it should look like this:

Code: Select all

define command{
command_name check-host-alive
command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5
}
This is exactly how mine looks. I don't know if i need to change any values in one or both of these spots?

I have also been seeing some weird issues in the Nagios UI that is basically saying the status is down but all the services are ok. I have seen this several times and tired reloading the Nagios service and the httpd service but it does not seem to fix it. This does eventually go away on its own and change to status up but it sometimes takes a day or so.
Attachments
Saying down when it is not.png
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios reporting host down but its not?

Post by scottwilkerson »

You show the configuration for a service, but what is the configuration for the host? it is the check_command for the host definition that will determine if it is marked down
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Alan
Posts: 86
Joined: Wed Aug 21, 2019 4:14 pm

Re: Nagios reporting host down but its not?

Post by Alan »

Here is the host setting.

Code: Select all

define host {
    host_name               Svr-Data
    address                 172.16.10.4
    hostgroups              VMs, physical_VMs
    check_command           check_ncpa!-t 'PublicS' -P 5693 -M system/agent_version
    max_check_attempts      5
    check_interval          5
    retry_interval          1
    check_period            24x7
    contact_groups          admins2, calls
    notification_interval   60
    notification_period     24x7
    notifications_enabled   1
    notification_options    d,u,r
    icon_image              ncpa.png
    statusmap_image         ncpa.png
    register                1
}
Is this what you are wanting?
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios reporting host down but its not?

Post by scottwilkerson »

Alan wrote:Is this what you are wanting?
Yes, and the Svr-Data shows down in the UI?

What is the output shown on the host status page?
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Alan
Posts: 86
Joined: Wed Aug 21, 2019 4:14 pm

Re: Nagios reporting host down but its not?

Post by Alan »

This is the Svr-data Nagios UI
Attachments
Svr-Data.png
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios reporting host down but its not?

Post by scottwilkerson »

Click on Svr-Data under the host column to see the host status detail page
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Alan
Posts: 86
Joined: Wed Aug 21, 2019 4:14 pm

Re: Nagios reporting host down but its not?

Post by Alan »

Sorry here is the Host State Information for Svr-Data.
Attachments
Host State Information.png
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios reporting host down but its not?

Post by scottwilkerson »

That host is reporting UP
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Alan
Posts: 86
Joined: Wed Aug 21, 2019 4:14 pm

Re: Nagios reporting host down but its not?

Post by Alan »

Ya sorry for any misunderstanding. It show up there has just been a few instances that I got a email from Nagois that said it was down. So I would login to the server and it was not down I was able to login to it. So am just trying to find out why it said it was down when it was not. I have had this same thing happen also on a few other servers.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios reporting host down but its not?

Post by scottwilkerson »

Are you sure it didn't go down (or lose connectivity) and then recover before you could check?
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Locked