Hello,
I am running Nagios XI 5.4.0 with Mod_Gearman installed. Currently there are no external workers.
Attached is a screenshot of the current stats on the server. Also attached is a screenshot of the gearman_top2 Output.
I am monitoring an all Windows environment with Check_WMI_Plus.
The issues I am experiencing are:
1. as I add more hosts and apply configuration the monitoring engine does not come back up unless I set the retain_state_information value to 0 in the nagios.cfg. (problematic as all services and hosts go to pending / unkown for about 45 minutes while the service checks chug through with results.)
2. now that the monitoring engine comes back up, the hostgroups summary / overview / Grid are all blank. Also when attempting to view Metrics, there are no hostgroups to select. In the Core Config Manager, the host groups are still defined, and I still see overall service stats. (Host Group Status screenshot attached.)
The main focus of this question is to get the host group status back, but I think these are related.
Host Group Summary blank
Host Group Summary blank
You do not have the required permissions to view the files attached to this post.
- Box293
- Too Basu
- Posts: 5126
- Joined: Sun Feb 07, 2010 10:55 pm
- Location: Deniliquin, Australia
- Contact:
Re: Host Group Summary blank
Strange, possibly the steps in the KB article I reference for the next issue might shed some light on it.kgugerty wrote:1. as I add more hosts and apply configuration the monitoring engine does not come back up unless I set the retain_state_information value to 0 in the nagios.cfg. (problematic as all services and hosts go to pending / unkown for about 45 minutes while the service checks chug through with results.)
If there are issues related that occur during/after apply config, the steps in this KB article should help identify a problem:
https://support.nagios.com/kb/article.php?id=34
This feels message queue related perhaps. Do any of the steps in this KB article help:kgugerty wrote:2. now that the monitoring engine comes back up, the hostgroups summary / overview / Grid are all blank. Also when attempting to view Metrics, there are no hostgroups to select. In the Core Config Manager, the host groups are still defined, and I still see overall service stats. (Host Group Status screenshot attached.)
https://support.nagios.com/kb/article.php?id=139
Let us know if any of this helps or if strange output is produced.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Re: Host Group Summary blank
Thank you very much!
Those were insightful articles. Here is specifically what we did:
1. We ran the upgrade to 5.4.2 - Per the KB Article https://support.nagios.com/kb/article.p ... ategory=44 , it appears there was a bug in 5.4.0 and 5.4.1. Instead of applying the fix in the article, we attempted the upgrade first.
2. Per this support forum topic https://support.nagios.com/forum/viewto ... 4&start=10 , we increased our load threshold:
3. we also followed the instructions in the KB you provided:
With all of that in place we are now up to 1300 Hosts and 6000 services and running smoothly.
Those were insightful articles. Here is specifically what we did:
1. We ran the upgrade to 5.4.2 - Per the KB Article https://support.nagios.com/kb/article.p ... ategory=44 , it appears there was a bug in 5.4.0 and 5.4.1. Instead of applying the fix in the article, we attempted the upgrade first.
2. Per this support forum topic https://support.nagios.com/forum/viewto ... 4&start=10 , we increased our load threshold:
We have 6 cores so we set the threshold at 50.0. Currently, our max load peaks at just above 8.0 with an avg of 4.0Yes, it seemed like your load was exceeding the default (10.0) value, and npcd was not running. Depending on the number of CPUs that you have, you can increase the threshold in the npcd.cfg (x number of CPUs):
single core -> load_threshold = 10.0
dual core -> load_threshold = 20.0
quad core -> load_threshold = 40.0
3. we also followed the instructions in the KB you provided:
Which suggested to double the values of these entries in /etc/sysctl.confIf there are issues related that occur during/after apply config, the steps in this KB article should help identify a problem:
https://support.nagios.com/kb/article.php?id=34
Code: Select all
kernel.msgmnb = 131072000
kernel.msgmax = 131072000
kernel.msgmni = 256000-
avandemore
- Posts: 1597
- Joined: Tue Sep 27, 2016 4:57 pm
Re: Host Group Summary blank
Great, it looks like that cleared it up for you. Are we good to mark this thread as resolved then?
Previous Nagios employee
Re: Host Group Summary blank
Yes, this issue is resolved. Thanks!