Flap Detection Question

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
tbmay
Posts: 67
Joined: Wed Jan 22, 2014 6:23 pm

Flap Detection Question

Post by tbmay »

Good morning,

I have a host that we got alerts saying it was flapping twice, which I can't see any evidence of state changes. All the services and the ping to the host looks good. No notifications. The graphs are well within parameters. Any idea how to proceed with the issue?

Thanks a million.
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Flap Detection Question

Post by lmiltchev »

I have a host that we got alerts saying it was flapping twice, which I can't see any evidence of state changes.
Did you check state change in Reports->State History (select the appopriate timeperiod)?
Is it possible you have timeout issues? Can you increase the timeout on the check?
You can also disable temporarily flapping on the host by going to: Home->Host Detail-><your host>->Advanced->Disable Flap Detection
...or you can try playing with different "Low/High flap threshold" values to see if this is going to resolve your issue (CCM->Hosts-><your host>->Modify->Check Settings).
Be sure to check out our Knowledgebase for helpful articles and solutions!
tbmay
Posts: 67
Joined: Wed Jan 22, 2014 6:23 pm

Re: Flap Detection Question

Post by tbmay »

Thank you. It was a Windows box and apparently the "Uptime" service is in some sort of funk. It says there was a "Socket timeout after 10 seconds."
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Flap Detection Question

Post by slansing »

Flapping occurs when a object rapidly changes between states, this is done so that you are not spammed with notifications that only tell you it is going through state changes with no indication of why. Perhaps the check was timing out, and then going into an Ok state when it was able to route to your server properly, and then going back to a timeout. Flapping can be adjusted, but it is possible that there is an issue in the network, or on the windows server. Are your other services on this host reporting fine? You should be able to get a good idea through the state history report of if they were doing well prior to the flapping occurring, and that may help troubleshoot.
Locked