How to monitor vMA appliance and deal with timeouts

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

How to monitor vMA appliance and deal with timeouts

Post by dlukinski »

Hello XI support

This question is to Troy Lea or anyone else how configured his VMWARE monitoring plugin & vMA:

We've got few clusters (5-6 depending on how to count) and 60+ hosts

vMA was created with 2CPU/4RAM/32 GB DISK (50% from recommended by Troy)
We added clusters (vCenter checks) - no issues, added smaller host groups (w/o storage), no issues, added mid-size host groups, again no issues.
Yesterday added 10 host from the large group: OK
and 10 more (everything exploded with timeouts and no responses) - "(Service check timed out after 60.00 seconds)"

We had to disable all host checks, which made vCenter checks clear.

One mistake made: we had to make vMA parent for all checks, including vCenter
Another question, since we still do not know what have happened, how to monitor vMA appliance to make sure it works?
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: How to monitor vMA appliance and deal with timeouts

Post by rkennedy »

I presume the resources are hitting a threshold, can you add more resources to the VMA appliance?

Hmm, when it hits a timeout - do you still receive a ping response from the machine or is it unresponsive?
Former Nagios Employee
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

Re: How to monitor vMA appliance and deal with timeouts

Post by dlukinski »

rkennedy wrote:I presume the resources are hitting a threshold, can you add more resources to the VMA appliance?

Hmm, when it hits a timeout - do you still receive a ping response from the machine or is it unresponsive?
We added more resources and it works better now.
Please close this thread
Locked