Page 1 of 2

Problems with check_oracle_health Tablespace check

Posted: Tue Sep 04, 2018 8:41 am
by nagiosEngie
Hello Nagios Crew,

I am having some problems with checking my Oracle DB's used tablespace.
Some DB's that are particularly big will take a long time to answer (over 60 sec) and I will get a critical alarm on timeout.

Our DBA has optimized to the maximum the nagios query on the DB server but we still go in timeout.
Is there a way to speed things up? Maybe parallel execution or other?

Thanks
Sandro

Re: Problems with check_oracle_health Tablespace check

Posted: Tue Sep 04, 2018 10:09 am
by mcapra
A previous post I made on this topic:
https://support.nagios.com/forum/viewto ... 77#p260977

In my opinion, the more optimal solution would be to break this particular check out of the main Nagios Core process by submitting it passively.

More info:
https://assets.nagios.com/downloads/nag ... hecks.html
https://assets.nagios.com/downloads/nag ... ios-XI.pdf

Re: Problems with check_oracle_health Tablespace check

Posted: Tue Sep 04, 2018 1:58 pm
by npolovenko
Thanks, @mcapra!

Re: Problems with check_oracle_health Tablespace check

Posted: Wed Sep 05, 2018 4:30 am
by nagiosEngie
Hello All,
thanks for the great suggestions. So at the moment I checked the version of the script I have: ad it was quite old. So I downloaded from https://labs.consol.de/nagios/check_ora ... index.html
the newest version and tested it. It's performance is quite improved. I now get answers under 30 sec on the biggest DB I have.
I will be keeping an eye on this and give feed back in a few days.

Thanks
Sandro

Re: Problems with check_oracle_health Tablespace check

Posted: Wed Sep 05, 2018 10:40 am
by npolovenko
@nagiosEngie, Sounds good! Thanks for the update.

Re: Problems with check_oracle_health Tablespace check

Posted: Thu Sep 06, 2018 9:16 am
by nagiosEngie
Well, I wish it was that easy.
I now get the critical 60.01 seconds timeout al least once an hour.
My service_check_timeout=120 is set to 2 minutes but still no way of solving this.

I tried the -t option in the check_oracle_health script but it seams to not work.

SAndro

Re: Problems with check_oracle_health Tablespace check

Posted: Thu Sep 06, 2018 10:59 am
by lmiltchev
Have you restarted nagios service after setting up:

Code: Select all

service_check_timeout=120
in the nagios.cfg file, so that changes can take effect?

Code: Select all

service nagios restart
Also, can you time your check while using the "-t" flag to see when it fails?
Example:

Code: Select all

time /usr/local/nagios/libexec/check_oracle_health <the rest of the command> -t 120 

Re: Problems with check_oracle_health Tablespace check

Posted: Wed Sep 19, 2018 2:36 am
by nagiosEngie
Hello I set the service_check_timeout and restarted nagios.
What dose exactly the -t option do?

Is it the timeout for the command? Something like wait for a maximum of xx seconds before timeout?

Sandro

Re: Problems with check_oracle_health Tablespace check

Posted: Wed Sep 19, 2018 9:23 am
by lmiltchev
Yes, it is the timeout option for the plugin.

Quote from the plugin's usage (help) menu:
[root@main-nagios-xi ~]# /usr/local/nagios/libexec/check_oracle_health -h

Copyright (c) 2008 Gerhard Lausser

Check various parameters of Oracle databases

Usage:
check_oracle_health [-v] [-t <timeout>] --connect=<connect string>
--username=<username> --password=<password> --mode=<mode>
--tablespace=<tablespace>
check_oracle_health [-h | --help]
check_oracle_health [-V | --version]

Re: Problems with check_oracle_health Tablespace check

Posted: Fri Sep 21, 2018 2:26 am
by nagiosEngie
Hello lmiltchev,

I have tried to set the -t option of the check_oracle_health script to 300. And the global nagios timeout (service_check_timeout=120)to 120.
Check will timeout in 120 sec anyway. image oracle01.jpg

As you can see the check fails and goes in time out after 120 sec instead of 300. why dose this happen?

As always Thankyou
Sandro