I know this question is kind of site specific and also depends on the priority of the machines.
But in general, for the monitoring frequency, how often do most people out there run their checks?
Monitoring Settings
Specify the parameters that determine how the service should be monitored.
Under normal circumstances...
Monitor the service every XXX minutes.
When a potential problem is first detected ...
Re-check the service every XXX minutes up to XXX times before generating an alert.
Monitoring Frequency
-
sreinhardt
- -fno-stack-protector
- Posts: 4366
- Joined: Mon Nov 19, 2012 12:10 pm
Re: Monitoring Frequency
Most people, will stick with the defaults of 5 minute check intervals, 5 retries at 1 minute intervals. There are definitely people that have other defaults, or select systems that need to be monitored more or less frequently, but 5-1-5 is about the norm.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
- Box293
- Too Basu
- Posts: 5126
- Joined: Sun Feb 07, 2010 10:55 pm
- Location: Deniliquin, Australia
- Contact:
Re: Monitoring Frequency
Some checks might not need to be run every 5 minutes. A disk space check might be OK every 20 minutes.
Ask yourself, how important is the check. This will help determine the frequency.
Ask yourself, how important is the check. This will help determine the frequency.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Re: Monitoring Frequency
I've created templates for hosts and services with prio1, prio2 and prio3 suffixes
interval - retries - retry interval
*prio1 = 05 - 02 - 02
*prio2 = 10 - 02 - 05
*prio3 = 15 - 02 - 05
Some checks run only once each hour because they take so long to run and consume too much resources. If you have performance data I noticed 1 hour interval is the maximum, otherwise the graphs stop working sometimes...
Got an old thread somewhere where Andy said they were working on a solution for that issue. Anyone know if this issue is still there?
Grtz
Willem
interval - retries - retry interval
*prio1 = 05 - 02 - 02
*prio2 = 10 - 02 - 05
*prio3 = 15 - 02 - 05
Some checks run only once each hour because they take so long to run and consume too much resources. If you have performance data I noticed 1 hour interval is the maximum, otherwise the graphs stop working sometimes...
Got an old thread somewhere where Andy said they were working on a solution for that issue. Anyone know if this issue is still there?
Grtz
Willem
Nagios XI 5.8.1
https://outsideit.net
https://outsideit.net
Re: Monitoring Frequency
@EnvBroker1
Let us know if you have any more questions.
Let us know if you have any more questions.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Monitoring Frequency
I think at this point a month after the topic started we can close it. Over time you'll get a feel for what your environment needs.
Former Nagios employee