Monitoring Wizard for solaris 10

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
qcmonitor
Posts: 18
Joined: Wed Feb 03, 2010 12:14 pm

Monitoring Wizard for solaris 10

Post by qcmonitor »

Hello

Is there a way to have a monitoring wizard to monitor a Solaris 10 box as we do for a Windows server?
NRPE is installed on the solaris box and we would like to monitor disk space, running services / processes...

Thanks

Chris
tonyyarusso
Posts: 1128
Joined: Wed Mar 03, 2010 12:38 pm
Location: St. Paul, MN, USA
Contact:

Re: Monitoring Wizard for solaris 10

Post by tonyyarusso »

Probably, although we've only just started trying to work with Solaris. There's a decent chance that the Linux wizard ( http://exchange.nagios.org/directory/Ad ... rd/details ) will also work with Solaris (at least mostly), so what I would suggest is giving that a shot and letting us know what does or doesn't work.
Tony Yarusso
Technical Services
___
TIES
Web: http://ties.k12.mn.us/
qcmonitor
Posts: 18
Joined: Wed Feb 03, 2010 12:14 pm

Re: Monitoring Wizard for solaris 10

Post by qcmonitor »

I will then try to use the Linux template and let you know.

Thanks

Chris
qcmonitor
Posts: 18
Joined: Wed Feb 03, 2010 12:14 pm

Re: Monitoring Wizard for solaris 10

Post by qcmonitor »

Hello

So I downloaded the wizard for Linux. I first try to install the agent on a linux machine, then use this wizard to monitor the linux box. Everything is working well to monitor this linux box.
So I installed NRPE on the Solaris box according to this procedure: http://www.askarali.org/nrpe_solaris10.html
I modified the nrpe.cfg file and put our Nagios server IP address: allowed_hosts=192.168.1.10

I tried to use the linux wizard but each monitored items, except ping, give the following error message:
CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages

When I try to run the command manually:
# ./check_nrpe -H 192.168.1.26 -t 30 -n -c check_disk -a '-w 20% -c 10% -p /u01'
CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages.

I try with and without -n in the command.
Also try the following without success:
# /usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/etc/nrpe.cfg -d

Any help will be appreciated to monitor a Solaris box with Nagios xi (Disk space, CPU, Memory, processes, Oracle...)

Best Regards

Chris
tonyyarusso
Posts: 1128
Joined: Wed Mar 03, 2010 12:38 pm
Location: St. Paul, MN, USA
Contact:

Re: Monitoring Wizard for solaris 10

Post by tonyyarusso »

The wizard also requires that the following option be set in addition to allowed_hosts:

dont_blame_nrpe = 1
Tony Yarusso
Technical Services
___
TIES
Web: http://ties.k12.mn.us/
qcmonitor
Posts: 18
Joined: Wed Feb 03, 2010 12:14 pm

Re: Monitoring Wizard for solaris 10

Post by qcmonitor »

Thanks for this information.
So I changed this parameter in the nrpe,cfg file and restarted the nrpe service.

Now the previous error message has disappeared, but I have new ones:

For most of the commands (check_disk, check_cpu_stats, check_swap, check_procs, check_open_files, check_mem, check_open_files, check_init_service) it returns the following error:
NRPE: Command 'check_disk' not defined

Only three monitored parameters are OK: Load, Ping and Users.

Any idea on how to proceed to monitor all failed checks?

Thanks

Chris
tonyyarusso
Posts: 1128
Joined: Wed Mar 03, 2010 12:38 pm
Location: St. Paul, MN, USA
Contact:

Re: Monitoring Wizard for solaris 10

Post by tonyyarusso »

These definitions can be found in one of the files shipped with the Agent tarball:

linux-nrpe-agent/subcomponents/nrpe/mods/cfg/nrpe/common.cfg

Place that in /etc/nagios/nrpe.d/ (or whatever is appropriate for Solaris - probably the same), restart NRPE, and your checks should work.
Tony Yarusso
Technical Services
___
TIES
Web: http://ties.k12.mn.us/
qcmonitor
Posts: 18
Joined: Wed Feb 03, 2010 12:14 pm

Re: Monitoring Wizard for solaris 10

Post by qcmonitor »

Hello

With your help and a few articles on the Internet, I finally managed to monitor my Solaris server.
So, I have copied the common.cfg file to a new directory (I created /opt/csw/etc/nrpe), gave permission to user nagios on this directory and file.
Then I needed to edit this file and put the right path for each scripts. I my case it was /opt/csw/libexec/nagios-plugins/<scriptname>
Some scripts for Linux are not available on Solaris so I just commented out the script in the common.cfg and have added those made for Solaris (scripts installed with the nrpe plugin and those downloaded and manually copied).

I aslo needed to edit the services from the Nagios interface and changed the name of the service ($ARG1) and in some cases, the $ARG2 line also according to the plugin.

Many thanks again for your help

Chris
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Monitoring Wizard for solaris 10

Post by mguthrie »

Thanks qc for posting your steps for this. I'm sure you're not the last person who will need to do this, and it helps us out a lot for future references.
Locked