Issue monitoring Oracle PMONs using 'check_procs'
Posted: Tue Oct 13, 2020 1:00 pm
Team,
I've been attempting to troubleshoot an issue I'm seeing related to monitoring Oracle PMONs with XI. We are trying to monitor 4 different active PMONs, but the names are very similar and XI appears to be adding a wildcard at the end of PMON name when it passes the argument.
For example, we have a PMON named ora_pmon_npasdevcdb and we use the command 'check_procs' in XI to monitor that it's active:
For some reason 'check_procs' uses 'check_nrpe' to invoke 'check_active_procs' in order to pass the same variables that 'check_procs' would use:
/usr/local/nagios/etc/nrpe.cfg
Now the issue we're seeing is one of the DBAs has created a new PMON named ora_pmon_npasdevcdb19 and both PMONs need to be running on the host. But, for some reason, the service check for ora_pmon_npasdevcdb is seeing two PMONs running with that name even though one of them is appended with 19. It's almost as if XI is adding a wildcard to the end of the PMON name.
I need to know how to add a stop or a break to the end of the PMON name, so XI stops seeing ora_pmon_npasdevcdb19 as part of this check. It has it's own PMON check which works fine.
The service check will not allow me to use any of the switches in this document: https://nagios-plugins.org/doc/man/check_procs.html
It fails and says such and such variable needs to be an integer... which it is.
Thank you.
I've been attempting to troubleshoot an issue I'm seeing related to monitoring Oracle PMONs with XI. We are trying to monitor 4 different active PMONs, but the names are very similar and XI appears to be adding a wildcard at the end of PMON name when it passes the argument.
For example, we have a PMON named ora_pmon_npasdevcdb and we use the command 'check_procs' in XI to monitor that it's active:
Code: Select all
$USER1$/check_nrpe -2 -H $HOSTADDRESS$ -t 30 -c check_active_procs -a "$ARG1$"/usr/local/nagios/etc/nrpe.cfg
Code: Select all
command[check_active_procs]=/usr/local/nagios/libexec/check_procs -c $ARG1$ -a $ARG2$Code: Select all
[nagios@nagiosxiserver ~]$ /usr/local/nagios/libexec/check_nrpe -2 -H xx.xx.xx.xx -t 30 -c check_active_procs -a ""1:1 ora_pmon_npasdevcdb""
PROCS CRITICAL: 2 processes with args 'ora_pmon_npasdevcdb'The service check will not allow me to use any of the switches in this document: https://nagios-plugins.org/doc/man/check_procs.html
It fails and says such and such variable needs to be an integer... which it is.
Thank you.