Page 1 of 1

Service check timed out after 60.01 seconds

Posted: Wed Oct 10, 2018 5:18 pm
by pbsindian
Hi Team,
Intermittently command check_oracle_health is throwing "Service check timed out after 60.01 seconds" error.

This could be happening because Oracle database is on high load and might be taking more than a minutes to respond to tablespace check query. I read in some forums and added "-t 300" to the command but we are still getting time outs in 1 minute.

- How to increase the timeout interval?
- How do we avoid getting alerts for timeout? We have a pooling cycle of 10 minutes and if issue persists for 10 minutes only than we need to receive alerts. But this case, alert comes in 1 minute and it's not a genuine alert too.

/usr/bin/env LD_LIBRARY_PATH=/usr/lib/oracle/12.1/client64/lib ORACLE_HOME=/usr/lib/oracle/12.1/client64 $USER1$/check_oracle_health $ARG1$

Where ARG1 is
-t 300 --connect 'XXXXXXX' --username 'nagios' --password 'XXXXXX' --mode tablespace-usage --warning 80 --critical 90

Re: Service check timed out after 60.01 seconds

Posted: Thu Oct 11, 2018 10:35 am
by lmiltchev
I read in some forums and added "-t 300" to the command but we are still getting time outs in 1 minute.
You may increase the plugin's timeout, however there is a "global" option in the nagios main config (nagios.cfg) that needs to be increased as well. The "default" value of the service checks timeout is set to 60 seconds:

Code: Select all

service_check_timeout=60
You would need to increase this value, and restart nagios so that changes can take effect.

Re: Service check timed out after 60.01 seconds

Posted: Thu Oct 11, 2018 12:20 pm
by pbsindian
Thank you. I have updated the global configuration and restarted nagios.
Will monitor and update you how it goes.

Thanks,
Bhargava

Re: Service check timed out after 60.01 seconds

Posted: Thu Oct 11, 2018 1:35 pm
by lmiltchev
Sure, let us know if you have any further questions.

Re: Service check timed out after 60.01 seconds

Posted: Mon Oct 15, 2018 2:54 pm
by pbsindian
That fixed the issue. Thanks a lot.

Re: Service check timed out after 60.01 seconds

Posted: Mon Oct 15, 2018 3:10 pm
by lmiltchev
I am glad I could help! :)