Hi Please could you help,
We monitor about 80 file systems that are shared out from a NAS. The NAS is an EMC celerra which has two Head Nodes, both Head Hodes are up on the network at the sametime but only one is the Primary and is used as the hostname in our Nagios service checks. Recently a software problem caused the Primary to failover to the second Head Node, because the file systems are not defined as a service on the second Head Node Nagios reports them all as critical, even though they were still available.
Can a service be defined to belong to two hosts? or, if at least one of the hosts is available the services stay in an OK state.
The Head Nodes have different IP address and there isn't an IP that fails over between them.
All help much appreciated!
Many thanks
Steve
How to monitor SAN file systems after Head Node failure
-
petronagios
- Posts: 28
- Joined: Tue Aug 16, 2011 8:02 am
Re: How to monitor SAN file systems after Head Node failure
Nagios BPI might be a handy tool for a situation like this. You can create a business process group, set rules to determine it's state, and then run checks against that group as a whole.
http://exchange.nagios.org/directory/Ad ... 29/details
http://exchange.nagios.org/directory/Ad ... 29/details
-
petronagios
- Posts: 28
- Joined: Tue Aug 16, 2011 8:02 am
Re: How to monitor SAN file systems after Head Node failure
Thanks for the quick reply, I'll have a look at BPI.
Cheers
Steve.
Cheers
Steve.
-
slansing
- Posts: 7698
- Joined: Mon Apr 23, 2012 4:28 pm
- Location: Travelling through time and space...
Re: How to monitor SAN file systems after Head Node failure
Closing and marking as resolved.