How to monitor SAN file systems after Head Node failure

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
petronagios
Posts: 28
Joined: Tue Aug 16, 2011 8:02 am

How to monitor SAN file systems after Head Node failure

Post by petronagios »

Hi Please could you help,

We monitor about 80 file systems that are shared out from a NAS. The NAS is an EMC celerra which has two Head Nodes, both Head Hodes are up on the network at the sametime but only one is the Primary and is used as the hostname in our Nagios service checks. Recently a software problem caused the Primary to failover to the second Head Node, because the file systems are not defined as a service on the second Head Node Nagios reports them all as critical, even though they were still available.

Can a service be defined to belong to two hosts? or, if at least one of the hosts is available the services stay in an OK state.

The Head Nodes have different IP address and there isn't an IP that fails over between them.

All help much appreciated!

Many thanks
Steve
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: How to monitor SAN file systems after Head Node failure

Post by mguthrie »

Nagios BPI might be a handy tool for a situation like this. You can create a business process group, set rules to determine it's state, and then run checks against that group as a whole.
http://exchange.nagios.org/directory/Ad ... 29/details
petronagios
Posts: 28
Joined: Tue Aug 16, 2011 8:02 am

Re: How to monitor SAN file systems after Head Node failure

Post by petronagios »

Thanks for the quick reply, I'll have a look at BPI.

Cheers
Steve.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: How to monitor SAN file systems after Head Node failure

Post by slansing »

Closing and marking as resolved.
Locked