time outs while doing check on specific devices

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

time outs while doing check on specific devices

Post by bosecorp »

I am getting time outs/alerts when Nagios does the check on specific devices. When I tried to ping them from my desktop or the worker works fine. this only seems to be happening for specific 4 devices.

at first I though it was something to do with the specific devices. however, this devices are in different geographic locations. In the same locations I have hundred of devices that are not having any kind of problem
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: time outs while doing check on specific devices

Post by tmcdonald »

What type of device is this?

What type of check?

Can you show the specific timeout message?
Former Nagios employee
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: time outs while doing check on specific devices

Post by bosecorp »

it just happen with a device. it seems to happen with network devices

"mydevice name" This host is flappingView service status details for this host Down 38s 1/15 2015-03-26 11:10:52 CRITICAL - 10.28.128.7: rta nan, lost 100%
cmerchant
Posts: 546
Joined: Wed Sep 24, 2014 11:19 am

Re: time outs while doing check on specific devices

Post by cmerchant »

What check are you using on the devices?

Can you run the check from the command line on the XI server?

Are these checks done on a specific ip address or hostname resolved by dns?
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: time outs while doing check on specific devices

Post by bosecorp »

checK_icmp

if I run it from the command line works.

it's done by IP
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: time outs while doing check on specific devices

Post by tgriep »

Can you run the check_icmp from the command line from the server that the check is running on and post the output here?
Also, can you post how the check_icmp command is configured?
Be sure to check out our Knowledgebase for helpful articles and solutions!
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: time outs while doing check on specific devices

Post by bosecorp »

this is how is configured

$USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$
cmerchant
Posts: 546
Joined: Wed Sep 24, 2014 11:19 am

Re: time outs while doing check on specific devices

Post by cmerchant »

Could you try adding a timeout parameter -t 30:

Code: Select all

$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5 -t 30
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: time outs while doing check on specific devices

Post by bosecorp »

what are those things for?

and it did not work.

i just saw a device going to down but is not. however it comes right back up
cmerchant
Posts: 546
Joined: Wed Sep 24, 2014 11:19 am

Re: time outs while doing check on specific devices

Post by cmerchant »

I should have just left the hard coded $ARG$ out of the example. The timeout parameter eliminates false positives.

$USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -t 30
Locked