monitoring with NRPE
monitoring with NRPE
Using RHEL 6.1 OS. Downloaded NRPE (linux-nrpe-agent.tar.gz) from Nagios site to remote monitered server. Trying to use NRPE to monitor a remote server; most if not all NRPE remote commands come back as "not defined" or "could not parse arguments". I've found only on remote command that works, "check_procs".
-
slansing
- Posts: 7698
- Joined: Mon Apr 23, 2012 4:28 pm
- Location: Travelling through time and space...
Re: monitoring with NRPE
Can you show us an example of the a command that is failing, along with the consoles full output?
check_procs most likely works because it does not require any arguments to return a value, it can, but by default it will just output all of the processes running on the system. But we do know from this that the NRPE agent is reachable by Nagios.
check_procs most likely works because it does not require any arguments to return a value, it can, but by default it will just output all of the processes running on the system. But we do know from this that the NRPE agent is reachable by Nagios.
Re: monitoring with NRPE
Which remote commands have you tried? The linux-server-agent from the XI wizard should have the following check commands in the file "common.cfg":
Code: Select all
### GENERIC SERVICES ###
command[check_init_service]=sudo /usr/local/nagios/libexec/check_init_service $ARG1$
command[check_services]=/usr/local/nagios/libexec/check_services -p $ARG1$
### MISC SYSTEM METRICS ###
#command[check_users]=/usr/local/nagios/libexec/check_users -w 5 -c 10
command[check_users]=/usr/local/nagios/libexec/check_users $ARG1$
command[check_load]=/usr/local/nagios/libexec/check_load $ARG1$
command[check_swap]=/usr/local/nagios/libexec/check_swap $ARG1$
command[check_cpu_stats]=/usr/local/nagios/libexec/check_cpu_stats.sh $ARG1$
command[check_mem]=/usr/local/nagios/libexec/custom_check_mem -n $ARG1$
### YUM UPDATES ###
command[check_yum]=/usr/local/nagios/libexec/check_yum
### DISK ###
command[check_disk]=/usr/local/nagios/libexec/check_disk $ARG1$
command[check_ide_smart]=/usr/local/nagios/libexec/check_ide_smart $ARG1$
### PROCESSES ###
command[check_all_procs]=/usr/local/nagios/libexec/custom_check_procs
command[check_procs]=/usr/local/nagios/libexec/check_procs $ARG1$
### OPEN FILES ###
command[check_open_files]=/usr/local/nagios/libexec/check_open_files.pl $ARG1$
### NETWORK CONNECTIONS ###
command[check_netstat]=/usr/local/nagios/libexec/check_netstat.pl -p $ARG1$ $ARG2$Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
Re: monitoring with NRPE
slansing wrote:Can you show us an example of the a command that is failing, along with the consoles full output?
check_procs most likely works because it does not require any arguments to return a value, it can, but by default it will just output all of the processes running on the system. But we do know from this that the NRPE agent is reachable by Nagios.
Below are examples of some commands that are failing, along with the consoles full output. I show commands ran from NagiosXI server (generated by wizard) and command line command ran on remote monitored server.
Example:1
--nagiosXI server:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H myserver-t 30 -c check_users
OUTPUT:
Usage:check_users -w -c
--remote monitored servere (myserver): tested iin both /opt/nagios-3.2.2/libexec/ and /opt/nagios/libexec/
./check_users -w 5 -c 10
USERS OK - 4 users currently logged in |users=4;5;10;0
Example:2
--nagiosXI server:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H myserver-t 30 -c 5.0,4.0,3.0 10.0,6.0,4.0
OUTPUT: NRPE: Command '5.0,4.0,3.0' not defined
--remote monitored servere (myserver): tested iin both /opt/nagios-3.2.2/libexec/ and /opt/nagios/libexec/
./check_load -w 15,10,5 -c 30,25,20
OK - load average: 0.27, 0.23, 0.17|load1=0.270;15.000;30.000;0; load5=0.230;10.000;25.000;0; load15=0.170;5.000;20.000;0;
Example:3 (THIS ONE WORKS)
--nagiosXI server:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H myserver-t 30 -c check_procs
OUTPUT: PROCS OK: 298 processes
--remote monitored servere (myserver): tested iin both /opt/nagios-3.2.2/libexec/ and /opt/nagios/libexec/
./check_procs
PROCS OK: 296 processes
Re: monitoring with NRPE
Can not find /locate file "common.cfg" on server.
Some of the remote commands I have tried are:
check_apt check_dns check_mailq check_swap check_breeze check_ifoperstatus check_mem.pl check_by_ssh check_ntp_time check_time
check_clamd check_cluster check_fping check_nagios check_dhcp check_ftp check_users check_dig check_ping check_disk check_http check_load check_nrpe check_ssh check_procs
Some of the remote commands I have tried are:
check_apt check_dns check_mailq check_swap check_breeze check_ifoperstatus check_mem.pl check_by_ssh check_ntp_time check_time
check_clamd check_cluster check_fping check_nagios check_dhcp check_ftp check_users check_dig check_ping check_disk check_http check_load check_nrpe check_ssh check_procs
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: monitoring with NRPE
It should be in /usr/local/nagios/etc/nrpe/jtorre08 wrote:Can not find /locate file "common.cfg" on server.
Re: monitoring with NRPE
scottwilkerson wrote:It should be in /usr/local/nagios/etc/nrpe/jtorre08 wrote:Can not find /locate file "common.cfg" on server.
Found file "common.cfg" in remote monitored server; Yes it has the same exact commands as listed in your 1st response posting. File does not exist on Nagiosxi server.
Re: monitoring with NRPE
This command expects warning and critical passed to it. You will need to add the warning and critical thresholds to the check in XI for $ARG2$:jtorre08 wrote: Example:1
--nagiosXI server:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H myserver-t 30 -c check_users
OUTPUT:
Usage:check_users -w -c
Code: Select all
$ARG2$: -a '-w 5 -c 10'This command is missing the actual command name. Try:Example:2
--nagiosXI server:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H myserver-t 30 -c 5.0,4.0,3.0 10.0,6.0,4.0
OUTPUT: NRPE: Command '5.0,4.0,3.0' not defined
Code: Select all
COMMAND: /usr/local/nagios/libexec/check_nrpe -H myserver-t 30 -c check_load
$ARG2$: -a '-w 15,10,5 -c 30,25,20' Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
Re: monitoring with NRPE
Yes, the "common.cfg" must be on the client machine. Open it, and see what commands you have defined there. For example, you might have:
From the nagios server, you can use this command by running:
Build your other checks in a similar way. You can view usage by running the plugin with the "-h" flag, for example:
Is there a command, that is defined in the "common.cfg" file on the client, but you are not able to setup a check in XI with it?
Code: Select all
command[check_users]=/usr/local/nagios/libexec/check_users $ARG1$Code: Select all
cd /usr/local/nagios/libexec/
./check_nrpe -H <client_IP> -t 30 -c check_users -a '-w 2 -c 5'Code: Select all
./check_users -h
check_users v1.4.16 (nagios-plugins 1.4.16)
Copyright (c) 1999 Ethan Galstad
Copyright (c) 2000-2007 Nagios Plugin Development Team
<[email protected]>
This plugin checks the number of users currently logged in on the local
system and generates an error if the number exceeds the thresholds specified.
Usage:
check_users -w <users> -c <users>
Options:
-h, --help
Print detailed help screen
-V, --version
Print version information
-w, --warning=INTEGER
Set WARNING status if more than INTEGER users are logged in
-c, --critical=INTEGER
Set CRITICAL status if more than INTEGER users are logged inBe sure to check out our Knowledgebase for helpful articles and solutions!