Page 1 of 1

check_oracle_health timed out

Posted: Mon Dec 18, 2017 6:20 am
by n8860104460
Team,

we have setup oracle health check time/tablespace/tns monitoring.

and while connecting to DB, on tablespace we are getting an error 'UNKNOWN - check_oracle_health timed out after 1800 seconds' however it should be connect with in 15 seconds but as this DB has huge tablespace so we are trying to connect it within 1800 seconds but still unable to fetch the data .


--- Able to connect with DB.

./check_oracle_health -t 5 --connect mc030000000:1512/test --username nagios --password '***' --mode connection-time --warning 1 --critical 2
WARNING - 1.62 seconds to connect as NAGIOS | connection_time=1.6184;1;2

--- tns ping is also working fine.

./check_oracle_health -t 15 --connect mc0300:00001512/test --username nagios --password '*****' --warning 70 --critical 80 --mode tnsping
OK - connection established to mc0300ux006:1531/ISRVE.

--- Tablespace timed out.

./check_oracle_health -t 1800 --connect mc03000000:1512/test --username nagios --password '****' --warning 95 --critical 96 --mode tablespace-usage
UNKNOWN - check_oracle_health timed out after 1800 seconds

ping/telnet and nrpe client is also working fine.

Please advise if any other details are required to more troubleshoot on this.

Re: check_oracle_health timed out

Posted: Mon Dec 18, 2017 9:00 am
by n8860104460
Team,

Adding into that ---

when i tried in 3600 seconds its gave me tables output, but as this is a huge output so --

* can we do something through which we can reduce time to collect the data.
* Or if No, then is there a way through which we filter it and it will show us selected tablespace output only.

Re: check_oracle_health timed out

Posted: Mon Dec 18, 2017 12:24 pm
by mcapra
I'm not much of an Oracle guy, but this:
n8860104460 wrote:* Or if No, then is there a way through which we filter it and it will show us selected tablespace output only.
Is addressed in the check_oracle_health documentation:
https://labs.consol.de/nagios/check_ora ... index.html
–tablespace
With this you can limit the check of a single tablespace. If this parameter is omitted all tablespaces are checked.
–datafile
With this you can limit the check of a single datafile. If this parameter is omitted all datafiles are checked.
–name
Here the check can be limited to a single object (Latch, Enqueue, Tablespace, Datafile). If this parameter is omitted all objects are checked. (Instead of –tablespace or –datafile this parameter can and should be used. It servers the purpose to standardize the CLI interface.)

Code: Select all

nagios$ check_oracle_health --mode tablespace-usage --tablespace USERS
OK - tbs USERS usage is 8.75% |
  'tbs_users_usage_pct'=8%;90;98
  'tbs_users_usage'=0MB;4;4;0;5

nagios$ check_oracle_health --mode tablespace-usage --name USERS
OK - tbs USERS usage is 8.75% |
  'tbs_users_usage_pct'=8%;90;98
  'tbs_users_usage'=0MB;4;4;0;5

nagios$ check_oracle_health --mode tablespace-free --name TEST
OK - tbs TEST has 97.91% free space left |
    'tbs_test_free_pct'=97.91%;5:;2:
    'tbs_test_free'=32083MB;1638.40:;655.36:;0.00;32767.98

Re: check_oracle_health timed out

Posted: Mon Dec 18, 2017 3:34 pm
by kyang
Thanks for the help @mcapra!

Take a look at mcapra's suggestion. The documentation should list what the plugin can or cannot do.

Let us know if you have any more questions.