Let me explain what I have and what I want to achieve.
I have installed nagios on opensuse ... works like a charm.
We have 200 computer labs with varying numbers of desktops from 7 to 50.
We have setup a group for each lab. (Eg Lab2064, Lab 2065 etc) .
This creates a long list of groups.
Is it possible to create a single group which would list the status of each lab,
eg if 1 or 2 machines down OK,
if 3 or 4 down warning,
and 5 or more critical, these figures would suit the majority of our labs which would have a quantity of 25 student desktops.
Labs with more or less PCs will need some tweaking of these reporting levels
with status information showing quantity down.
Im not sure if this is a capability current available or could possibly be a future enhancement.
If its not currently available, is there a way to access the information through scripting or otherwsie to achieve the same goal.
eg. I presume nagios stores all the recent results of tests in a log file(s) of sorts. Are there tools to manipulate these files?
i.e. If I have a group called Lab2064 which has within it 25 lab PCs, the summary for the group shows say 4 critical (host checks) (ie not accessable by simple ping).
Where would this info be stored, how could it be retrieved? and in what format? ... are there plugins available which could access this ... similarly for service checks.
Any pointers/ assistance would be appreciated.
Regards
Liam