Page 1 of 2
Problems with check_oracle_health Tablespace check
Posted: Tue Sep 04, 2018 8:41 am
by nagiosEngie
Hello Nagios Crew,
I am having some problems with checking my Oracle DB's used tablespace.
Some DB's that are particularly big will take a long time to answer (over 60 sec) and I will get a critical alarm on timeout.
Our DBA has optimized to the maximum the nagios query on the DB server but we still go in timeout.
Is there a way to speed things up? Maybe parallel execution or other?
Thanks
Sandro
Re: Problems with check_oracle_health Tablespace check
Posted: Tue Sep 04, 2018 10:09 am
by mcapra
A previous post I made on this topic:
https://support.nagios.com/forum/viewto ... 77#p260977
In my opinion, the more optimal solution would be to break this particular check out of the main Nagios Core process by submitting it passively.
More info:
https://assets.nagios.com/downloads/nag ... hecks.html
https://assets.nagios.com/downloads/nag ... ios-XI.pdf
Re: Problems with check_oracle_health Tablespace check
Posted: Tue Sep 04, 2018 1:58 pm
by npolovenko
Re: Problems with check_oracle_health Tablespace check
Posted: Wed Sep 05, 2018 4:30 am
by nagiosEngie
Hello All,
thanks for the great suggestions. So at the moment I checked the version of the script I have: ad it was quite old. So I downloaded from
https://labs.consol.de/nagios/check_ora ... index.html
the newest version and tested it. It's performance is quite improved. I now get answers under 30 sec on the biggest DB I have.
I will be keeping an eye on this and give feed back in a few days.
Thanks
Sandro
Re: Problems with check_oracle_health Tablespace check
Posted: Wed Sep 05, 2018 10:40 am
by npolovenko
@nagiosEngie, Sounds good! Thanks for the update.
Re: Problems with check_oracle_health Tablespace check
Posted: Thu Sep 06, 2018 9:16 am
by nagiosEngie
Well, I wish it was that easy.
I now get the critical 60.01 seconds timeout al least once an hour.
My service_check_timeout=120 is set to 2 minutes but still no way of solving this.
I tried the -t option in the check_oracle_health script but it seams to not work.
SAndro
Re: Problems with check_oracle_health Tablespace check
Posted: Thu Sep 06, 2018 10:59 am
by lmiltchev
Have you restarted nagios service after setting up:
in the nagios.cfg file, so that changes can take effect?
Also, can you time your check while using the "-t" flag to see when it fails?
Example:
Code: Select all
time /usr/local/nagios/libexec/check_oracle_health <the rest of the command> -t 120
Re: Problems with check_oracle_health Tablespace check
Posted: Wed Sep 19, 2018 2:36 am
by nagiosEngie
Hello I set the service_check_timeout and restarted nagios.
What dose exactly the -t option do?
Is it the timeout for the command? Something like wait for a maximum of xx seconds before timeout?
Sandro
Re: Problems with check_oracle_health Tablespace check
Posted: Wed Sep 19, 2018 9:23 am
by lmiltchev
Yes, it is the timeout option for the plugin.
Quote from the plugin's usage (help) menu:
[root@main-nagios-xi ~]# /usr/local/nagios/libexec/check_oracle_health -h
Copyright (c) 2008 Gerhard Lausser
Check various parameters of Oracle databases
Usage:
check_oracle_health [-v] [-t <timeout>] --connect=<connect string>
--username=<username> --password=<password> --mode=<mode>
--tablespace=<tablespace>
check_oracle_health [-h | --help]
check_oracle_health [-V | --version]
Re: Problems with check_oracle_health Tablespace check
Posted: Fri Sep 21, 2018 2:26 am
by nagiosEngie
Hello lmiltchev,
I have tried to set the -t option of the check_oracle_health script to 300. And the global nagios timeout (service_check_timeout=120)to 120.
Check will timeout in 120 sec anyway. image oracle01.jpg
As you can see the check fails and goes in time out after 120 sec instead of 300. why dose this happen?
As always Thankyou
Sandro