Best practices: Monitoring multiple windows services
Posted: Wed Jun 24, 2015 10:59 pm
I have successfully set up monitoring multiple Windows services with NRPE using the command below. We have several server "types" we group our servers in (i.e. Citrix, Web, Database, etc.) which will each have their own bag of services we want to watch. Easy enough to configure and manage with services applied to host groups for each type.
My questions are:
1. Is there a better way to do this? We want to stick to NRPE strictly based on what we've read and been told about it's flexibility, but we're open to suggestions. We're new! So we are probably doing things wrong! We have an environment that is a mix of Linux and Windows servers, which I believe was another reason NRPE was recommended.
2. My impression is it's better to cram all of the individual Windows services we can into one Nagios service to ultimately keep the Nagios service count down. Possibly being more efficient? Does that make sense?
3. If that does make sense, is there any way to alarm on Windows services differently (i.e. Critical vs. Warning) if all of the services are being watched by the same Nagios service? So in my command example above, can I alarm as a warning for a stopped state for service1, but as a critical for a stopped state for services 2 and 3? All while only using one Nagios service definition?
Thanks!
Code: Select all
./check_nrpe -H <hostname or IP> -t 30 -c check_service -a service=<service1> service=<service2> service=<service3> 'crit=not state_is_ok()'
OK: All 3 service(s) are ok.1. Is there a better way to do this? We want to stick to NRPE strictly based on what we've read and been told about it's flexibility, but we're open to suggestions. We're new! So we are probably doing things wrong! We have an environment that is a mix of Linux and Windows servers, which I believe was another reason NRPE was recommended.
2. My impression is it's better to cram all of the individual Windows services we can into one Nagios service to ultimately keep the Nagios service count down. Possibly being more efficient? Does that make sense?
3. If that does make sense, is there any way to alarm on Windows services differently (i.e. Critical vs. Warning) if all of the services are being watched by the same Nagios service? So in my command example above, can I alarm as a warning for a stopped state for service1, but as a critical for a stopped state for services 2 and 3? All while only using one Nagios service definition?
Thanks!