NRPE: Check drive size in active/passive cluster
Posted: Tue Aug 04, 2015 4:02 pm
Hi guys!
Environment:
Nagios XI 2014R2.7
NSClient++ 0.4.3.143,
Mostly NRPE checks
We have several active/passive windows clusters that serve various roles (file shares, MS SQL, etc.). I am setting up NRPE checks per what I've read online, which is to direct any check for a shared service to the cluster hostname/vip instead of either of the nodes. Here are my questions/issues:
1. Is this the only way to follow shared services between nodes? Scrap that - what is the best way to do this?
2. I have no issues with check_cpu or check_service when pointed at the cluster, but check_drivesize is giving me trouble. I should actually be good to go if I can get this to work. Here is the command I'm running on the Nagios server to test the check:
Pretty simple. Here's what it returns:
Googling led me to suggestions for enabling arguments or other changes in the nsclient.ini, but all that's already enabled. The same nsclient.ini config and this check work fine on any other server.
If I try
Same Invalid packet type error.
If I try (local OS drive)
Success!
If I try (cluster drive presented from SAN)
Success!
I'm trying to avoid defining all drives by name, as that changes across clusters. Our other check_drivesize definitions check for all local drives with this command, but it also returns the Invalid packet type when run against a cluster.
Any ideas?
Thanks!
Environment:
Nagios XI 2014R2.7
NSClient++ 0.4.3.143,
Mostly NRPE checks
We have several active/passive windows clusters that serve various roles (file shares, MS SQL, etc.). I am setting up NRPE checks per what I've read online, which is to direct any check for a shared service to the cluster hostname/vip instead of either of the nodes. Here are my questions/issues:
1. Is this the only way to follow shared services between nodes? Scrap that - what is the best way to do this?
2. I have no issues with check_cpu or check_service when pointed at the cluster, but check_drivesize is giving me trouble. I should actually be good to go if I can get this to work. Here is the command I'm running on the Nagios server to test the check:
Code: Select all
$ ./check_nrpe -H [cluster hostname] -t 30 -c check_drivesizeCode: Select all
CHECK_NRPE: Invalid packet type received from server.If I try
Code: Select all
$ ./check_nrpe -H [cluster hostname] -t 30 -c check_drivesize -a drive=*If I try (local OS drive)
Code: Select all
$ ./check_nrpe -H [cluster hostname] -t 30 -c check_drivesize -a drive=c:Code: Select all
OK All 1 drive(s) are ok|'c: used'=130.36176GB;223.41249;251.33905;0;279.26562 'c: used %'=46%;79;89;0;100Code: Select all
$ ./check_nrpe -H [cluster hostname] -t 30 -c check_drivesize -a drive=o:Code: Select all
OK All 1 drive(s) are ok|'o: used'=32.76171MB;791.99687;890.99648;0;989.99609 'o: used %'=3%;79;89;0;100Code: Select all
$ ./check_nrpe -H [cluster hostname] -t 30 -c check_drivesize -a "drive regexp '.*[C-Z].*'" 'warn=free lt 20%' 'crit=free lt 10%'Thanks!