Page 1 of 3
monitor PMON and listner
Posted: Thu Jun 05, 2014 5:37 am
by cyient
Hi,
We are looking to monitor PMON and listner in a Oracle server but unable to find the plugins.
Any idea about this ?
Re: monitor PMON and listner
Posted: Thu Jun 05, 2014 8:15 am
by scottwilkerson
Re: monitor PMON and listner
Posted: Tue Jun 10, 2014 7:41 am
by cyient
check_pid.doc
Hi i have written a bash script for monitoring the process which we are looking for but i get timed out error when calling the script from nagios server through nrpe but when i call other check_load or check_user commands i get the output easily.
Is there any modification to be done to the script, i have attached the script for reference.
Please suggest on this.
Re: monitor PMON and listner
Posted: Tue Jun 10, 2014 1:28 pm
by abrist
Can you post the service and command definition for the check, as well as the nrpe.cfg command from the remote host?
Re: monitor PMON and listner
Posted: Wed Jun 11, 2014 8:25 am
by cyient
I'm executing the service through nrpe from nagios server as below
Code: Select all
/usr/local/nagios/libexec/check_nrpe -H 192.168.3.4 -c check_ora
in nrpe.cfg file on remote server command definition is as below
Code: Select all
command[check_ora]=/usr/local/nagios/libexec/check_pid -p ora_pmon_pcontrolqq
Re: monitor PMON and listner
Posted: Wed Jun 11, 2014 10:58 am
by abrist
You may have permission problems or timeout issues.
Run the following on the remote system:
Code: Select all
su - nagios -c "time /usr/local/nagios/libexec/check_pid -p ora_pmon_pcontrolqq"
Re: monitor PMON and listner
Posted: Thu Jun 12, 2014 7:12 am
by cyient
Here is the output of the command
Code: Select all
su - nagios -c "time /usr/local/nagios/libexec/check_pid -p ora_pmon_pcontrolqq"
Password:
Usage: grep [OPTION]... PATTERN [FILE]...
Try `grep --help' for more information.
ora_pmon_pcontrolqq is running.
real 0m0.086s
user 0m0.005s
sys 0m0.026s
Re: monitor PMON and listner
Posted: Thu Jun 12, 2014 10:23 am
by sreinhardt
While I am not sure which one it is, as they are all valid, or so it seems one of your greps is failing. This may or may not cause the issue you are seeing, but should be resolved regardless. The standard error output may be causing the delay or pseudo delay that makes check_nrpe timeout. You might also try adding a -t 30 flag to extend the timeout even though it doesn't appear to need it.
Re: monitor PMON and listner
Posted: Fri Jun 13, 2014 11:45 pm
by cyient
I have tried using the timeout arg but it did not work.
Code: Select all
$ /usr/local/nagios/libexec/check_nrpe -H 192.168.3.4 -c check_ora -t 30
CHECK_NRPE: Socket timeout after 30 seconds.
Re: monitor PMON and listner
Posted: Mon Jun 16, 2014 9:09 am
by slansing
Have you verified that you can make an actual connection via the NRPE socket to that host? What is the output of:
Code: Select all
/usr/local/nagios/libexec/check_nrpe -H 192.168.3.4