Oracle Tablespace monitoring - timed out

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
amane
Posts: 77
Joined: Thu Jan 18, 2018 9:53 am

Oracle Tablespace monitoring - timed out

Post by amane »

Hi,

We are monitoring Oracle tablespaces for one of the server and status for the tablespaces metric status continuously flapping from ok to unknown.
please find below the error which occurred when metric status going to unknown state.
UNKNOWN - check_oracle_health timed out after 180 seconds

We have increased the timeout to 300 even though status not changed for these tablespaces.

Oracle Client 12.2 is running on nagios instance.
nagios Oracle plugin version for tablespace - 1.5.3

Could you please help me to resolve this issue.

Thanks & Regards,
Avinash
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Oracle Tablespace monitoring - timed out

Post by tgriep »

That version of the plugin is very old, can you upgrade it to the latest version at this link?
https://labs.consol.de/nagios/check_ora ... index.html

Can you run the command in a shell and increase the timeout value to 3000 seconds and does the plugin run to completion?
If so, how long does it take?
Be sure to check out our Knowledgebase for helpful articles and solutions!
amane
Posts: 77
Joined: Thu Jan 18, 2018 9:53 am

Re: Oracle Tablespace monitoring - timed out

Post by amane »

Thanks for the update tgriep,

Running Oracle plugin version is 3.1.2.2 on our Nagios XI servers.

[nagios@<server name> ~]$ /usr/local/nagios/libexec/check_oracle_health -V
check_oracle_health (3.1.2.2)

We ran command in shell on nagios instance and within 3-4 min we got the output. We ran the command three times and all the time command completed and shown the output.

Thanks & Regards,
Avinash
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Oracle Tablespace monitoring - timed out

Post by tgriep »

Since the plugin could take up to 4 minutes to run, then you will have to increase the timeout for the check_oracle_health command as well as the service timeout in the nagios.cfg file to a time that is longer than 4 minutes.
Try doing that and see if this helps out in the issue.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked