Page 1 of 3

monitor PMON and listner

Posted: Thu Jun 05, 2014 5:37 am
by cyient
Hi,

We are looking to monitor PMON and listner in a Oracle server but unable to find the plugins.

Any idea about this ?

Re: monitor PMON and listner

Posted: Thu Jun 05, 2014 8:15 am
by scottwilkerson
This listing says it can monitor the PMON process
http://exchange.nagios.org/directory/Pl ... 29/details

Re: monitor PMON and listner

Posted: Tue Jun 10, 2014 7:41 am
by cyient
check_pid.doc
Hi i have written a bash script for monitoring the process which we are looking for but i get timed out error when calling the script from nagios server through nrpe but when i call other check_load or check_user commands i get the output easily.

Is there any modification to be done to the script, i have attached the script for reference.

Please suggest on this.

Re: monitor PMON and listner

Posted: Tue Jun 10, 2014 1:28 pm
by abrist
Can you post the service and command definition for the check, as well as the nrpe.cfg command from the remote host?

Re: monitor PMON and listner

Posted: Wed Jun 11, 2014 8:25 am
by cyient
I'm executing the service through nrpe from nagios server as below

Code: Select all

/usr/local/nagios/libexec/check_nrpe -H 192.168.3.4 -c check_ora
in nrpe.cfg file on remote server command definition is as below

Code: Select all

command[check_ora]=/usr/local/nagios/libexec/check_pid -p ora_pmon_pcontrolqq

Re: monitor PMON and listner

Posted: Wed Jun 11, 2014 10:58 am
by abrist
You may have permission problems or timeout issues.
Run the following on the remote system:

Code: Select all

su - nagios -c "time /usr/local/nagios/libexec/check_pid -p ora_pmon_pcontrolqq"

Re: monitor PMON and listner

Posted: Thu Jun 12, 2014 7:12 am
by cyient
Here is the output of the command

Code: Select all

 su - nagios -c "time /usr/local/nagios/libexec/check_pid -p ora_pmon_pcontrolqq"
Password:
Usage: grep [OPTION]... PATTERN [FILE]...
Try `grep --help' for more information.
ora_pmon_pcontrolqq is running.

real    0m0.086s
user    0m0.005s
sys     0m0.026s

Re: monitor PMON and listner

Posted: Thu Jun 12, 2014 10:23 am
by sreinhardt
While I am not sure which one it is, as they are all valid, or so it seems one of your greps is failing. This may or may not cause the issue you are seeing, but should be resolved regardless. The standard error output may be causing the delay or pseudo delay that makes check_nrpe timeout. You might also try adding a -t 30 flag to extend the timeout even though it doesn't appear to need it.

Re: monitor PMON and listner

Posted: Fri Jun 13, 2014 11:45 pm
by cyient
I have tried using the timeout arg but it did not work.

Code: Select all

$ /usr/local/nagios/libexec/check_nrpe -H 192.168.3.4 -c check_ora -t 30
CHECK_NRPE: Socket timeout after 30 seconds.

Re: monitor PMON and listner

Posted: Mon Jun 16, 2014 9:09 am
by slansing
Have you verified that you can make an actual connection via the NRPE socket to that host? What is the output of:

Code: Select all

/usr/local/nagios/libexec/check_nrpe -H 192.168.3.4