Page 1 of 1

Host Group Summary blank

Posted: Thu Mar 09, 2017 2:10 pm
by kgugerty
Hello,

I am running Nagios XI 5.4.0 with Mod_Gearman installed. Currently there are no external workers.
Attached is a screenshot of the current stats on the server. Also attached is a screenshot of the gearman_top2 Output.
I am monitoring an all Windows environment with Check_WMI_Plus.

The issues I am experiencing are:
1. as I add more hosts and apply configuration the monitoring engine does not come back up unless I set the retain_state_information value to 0 in the nagios.cfg. (problematic as all services and hosts go to pending / unkown for about 45 minutes while the service checks chug through with results.)
2. now that the monitoring engine comes back up, the hostgroups summary / overview / Grid are all blank. Also when attempting to view Metrics, there are no hostgroups to select. In the Core Config Manager, the host groups are still defined, and I still see overall service stats. (Host Group Status screenshot attached.)

The main focus of this question is to get the host group status back, but I think these are related.

Re: Host Group Summary blank

Posted: Thu Mar 09, 2017 6:27 pm
by Box293
kgugerty wrote:1. as I add more hosts and apply configuration the monitoring engine does not come back up unless I set the retain_state_information value to 0 in the nagios.cfg. (problematic as all services and hosts go to pending / unkown for about 45 minutes while the service checks chug through with results.)
Strange, possibly the steps in the KB article I reference for the next issue might shed some light on it.

If there are issues related that occur during/after apply config, the steps in this KB article should help identify a problem:
https://support.nagios.com/kb/article.php?id=34

kgugerty wrote:2. now that the monitoring engine comes back up, the hostgroups summary / overview / Grid are all blank. Also when attempting to view Metrics, there are no hostgroups to select. In the Core Config Manager, the host groups are still defined, and I still see overall service stats. (Host Group Status screenshot attached.)
This feels message queue related perhaps. Do any of the steps in this KB article help:
https://support.nagios.com/kb/article.php?id=139

Let us know if any of this helps or if strange output is produced.

Re: Host Group Summary blank

Posted: Mon Mar 13, 2017 10:09 am
by kgugerty
Thank you very much!

Those were insightful articles. Here is specifically what we did:

1. We ran the upgrade to 5.4.2 - Per the KB Article https://support.nagios.com/kb/article.p ... ategory=44 , it appears there was a bug in 5.4.0 and 5.4.1. Instead of applying the fix in the article, we attempted the upgrade first.
2. Per this support forum topic https://support.nagios.com/forum/viewto ... 4&start=10 , we increased our load threshold:
Yes, it seemed like your load was exceeding the default (10.0) value, and npcd was not running. Depending on the number of CPUs that you have, you can increase the threshold in the npcd.cfg (x number of CPUs):

single core -> load_threshold = 10.0
dual core -> load_threshold = 20.0
quad core -> load_threshold = 40.0
We have 6 cores so we set the threshold at 50.0. Currently, our max load peaks at just above 8.0 with an avg of 4.0

3. we also followed the instructions in the KB you provided:
If there are issues related that occur during/after apply config, the steps in this KB article should help identify a problem:
https://support.nagios.com/kb/article.php?id=34
Which suggested to double the values of these entries in /etc/sysctl.conf

Code: Select all

kernel.msgmnb = 131072000
kernel.msgmax = 131072000
kernel.msgmni = 256000
With all of that in place we are now up to 1300 Hosts and 6000 services and running smoothly.

Re: Host Group Summary blank

Posted: Mon Mar 13, 2017 3:44 pm
by avandemore
Great, it looks like that cleared it up for you. Are we good to mark this thread as resolved then?

Re: Host Group Summary blank

Posted: Tue Mar 14, 2017 1:19 pm
by kgugerty
Yes, this issue is resolved. Thanks!