Page 1 of 1
Nagios error daily
Posted: Mon Nov 12, 2018 3:30 am
by Phill123
in nagios each day I get notification critical alarm " CRITICAL - Plugin timed out while executing system call" daily at 5:00 am and at 5:10 am I got recovery notification, it is something strange below is script that i use for device, anyone faced this issue before
Code: Select all
define service{
use generic-service
host_name GRS-BB01
service_description Ports G
check_command check_snmp!-C TEST -o .1.3.6.1.2.1.2.2.1.8.28 -w 1 -c 1
}
Re: Nagios error daily
Posted: Mon Nov 12, 2018 11:43 am
by bolson
Hello Phill123,
If this issue only affects this service check, and it consistently occurs at the same time every day, it's very unlikely that it's caused on the Nagios side. I would look at the logs on GRS-BB01 and see if anything unusual happens at 5:00AM. Also, if you're monitoring other metrics on GRS-BB01, you might look at the memory and cpu performance graphs in Nagios to see if there are any spikes at that time. Let me know what you find out.
Thank you for visiting the Nagios Support Forum!
Re: Nagios error daily
Posted: Thu Nov 15, 2018 12:14 am
by Phill123
How i can check logs on GRS-BB01
I monitor a lot of devices and metrics and this error it show on two devices only for specific metric
Re: Nagios error daily
Posted: Thu Nov 15, 2018 12:53 pm
by bolson
Hello phill123,
What kind of device are you monitoring?
Re: Nagios error daily
Posted: Sun Nov 18, 2018 2:11 am
by Phill123
Cisco switches
Re: Nagios error daily
Posted: Mon Nov 19, 2018 1:36 pm
by scottwilkerson
.1.3.6.1.2.1.2.2.1.8.28 is checking ifOperStatus on interface 28
It is possible because of some other process that is going on that your switch is very busy between 5:00 am and at 5:10 am and being SNMP is rated low priority, is not giving a response during that time.
as mentioned before :
bolson wrote:If this issue only affects this service check, and it consistently occurs at the same time every day, it's very unlikely that it's caused on the Nagios side. I would look at the logs on GRS-BB01 and see if anything unusual happens at 5:00AM. Also, if you're monitoring other metrics on GRS-BB01, you might look at the memory and cpu performance graphs in Nagios to see if there are any spikes at that time