I should clarify my previous response...was on the train w. BB.
We have a mysql HA cluster service called mysql_service that runs on two nodes. I know that's what it is called because running 'clustat' returns that under the service name field (and that's the configured name in luci, too).
From NagiosXI the service is defined as 'check_nrpe$hostname$check_mysql' (not exact, but you get the idea).
On the cluster nodes I have a command defined:
Code: Select all
command[check_mysql]=/usr/lib64/nagios/plugins/check_cluster -s -c 0:0 -d mysql_service
That outputs an OK, because the service is running, making the total 1 - only go critical if the total is 0, right?
Well this command does the exact same thing:
Code: Select all
command[check_mysql]=/usr/lib64/nagios/plugins/check_cluster -s -c 0:0 -d 0
See what I mean? The '-d' portion only cares that SOMETHING is separated by commas, doesn't matter what.
To go back to my original question, the following two commands provide identical results:
Code: Select all
command[check_mysql]=/usr/lib64/nagios/plugins/check_cluster -s -c 0:0 -d mysql_service,goober_service,test_service
Code: Select all
command[check_mysql]=/usr/lib64/nagios/plugins/check_cluster -s -c 0:0 -d 0,0,0
It actually doesn't even matter if the former services are REAL, just requires something present separated by commas. Further, if I have those three services running on a single cluster, this means there is no way to just monitor one. I suppose there is a valid argument for 'you want to know if ANY cluster service goes down', but you get what I'm asking.
So, am I completely missing the point of this plugin, missing some syntax, or is there an issue here?
