Page 1 of 1

How to monitor servers in HA mode?

Posted: Tue Aug 21, 2012 5:58 am
by xpertech
It seems I can't successful monitor servers in HA mode(cluster)?

Is it the same way to monitor single server and servers in HA mode?

Re: How to monitor servers in HA mode?

Posted: Tue Aug 21, 2012 9:16 am
by gwakem
We monitor the VIP, that way when failover occurs and the VIP follows, monitoring of the service is maintained.

Re: How to monitor servers in HA mode?

Posted: Tue Aug 21, 2012 11:48 am
by xpertech
gwakem wrote:We monitor the VIP, that way when failover occurs and the VIP follows, monitoring of the service is maintained.
I did monitor via the VIP, but it seems I can't monitor through in the cluster?!
should I notice something?

Re: How to monitor servers in HA mode?

Posted: Tue Aug 21, 2012 11:55 am
by gwakem
You should be able to monitor each individual node in the cluster, but you will get a crit on the passive node, since the application or process you're monitoring for isn't active until HA fail over. In most cases, I have found that monitoring the VIP only for whatever application is enough to verify up status and graph perfdata. For everything else (ie. OS level checking like disk, cpu, etc,) checking the individual nodes should work as intended.

There are a wealth of documents in the library section (http://library.nagios.com/library/produ ... umentation) that should help cover individual situations like those you've posted about.

Re: How to monitor servers in HA mode?

Posted: Tue Aug 21, 2012 2:36 pm
by CGraham
I've accomplished this by writing a custom check that ran 2 separate checks and compared the results. Critical on Both UP or Down, OK if only one is UP.

Alternately you could use check_multi:
http://exchange.nagios.org/directory/Pl ... ti/details

But it takes a little bit of configuration....

Re: How to monitor servers in HA mode?

Posted: Tue Aug 21, 2012 2:43 pm
by scottwilkerson
CGraham & gwakem,

Thank you both for sharing your experiences.

Re: How to monitor servers in HA mode?

Posted: Thu Aug 23, 2012 12:23 am
by xpertech
CGraham wrote:I've accomplished this by writing a custom check that ran 2 separate checks and compared the results. Critical on Both UP or Down, OK if only one is UP.

Alternately you could use check_multi:
http://exchange.nagios.org/directory/Pl ... ti/details

But it takes a little bit of configuration....
Thanks, I will tr......y hard and see if succeed. 8-)