Nagios Portal Issue
Nagios Portal Issue
Hi Team,
We are getting below blank details on Nagios dashboard. Attached screenshot for reference.
We have tried the below steps but the issue still persists.
============================================================
systemctl stop crond
systemctl stop npcd
systemctl stop nagios
systemctl stop ndo2db
killall -9 ndo2db
pkill -9 -u nagios
for i in $(ipcs -q | grep nagios |awk '{print $2}'); do ipcrm -q $i; done
rm -rf /usr/local/nagiosxi/var/dbmaint.lock
rm -rf /usr/local/nagiosxi/var/event_handler.lock
rm -rf /usr/local/nagiosxi/scripts/reconfigure_nagios.lock
systemctl start ndo2db
systemctl start nagios
systemctl start npcd
systemctl start crond
===============================================
Note: If we refresh multiple times, some host/service summary will show, and during the issue time, the specific host/hostgroup details also not showing.
Thanks,
We are getting below blank details on Nagios dashboard. Attached screenshot for reference.
We have tried the below steps but the issue still persists.
============================================================
systemctl stop crond
systemctl stop npcd
systemctl stop nagios
systemctl stop ndo2db
killall -9 ndo2db
pkill -9 -u nagios
for i in $(ipcs -q | grep nagios |awk '{print $2}'); do ipcrm -q $i; done
rm -rf /usr/local/nagiosxi/var/dbmaint.lock
rm -rf /usr/local/nagiosxi/var/event_handler.lock
rm -rf /usr/local/nagiosxi/scripts/reconfigure_nagios.lock
systemctl start ndo2db
systemctl start nagios
systemctl start npcd
systemctl start crond
===============================================
Note: If we refresh multiple times, some host/service summary will show, and during the issue time, the specific host/hostgroup details also not showing.
Thanks,
You do not have the required permissions to view the files attached to this post.
Re: Nagios Portal Issue
Removed
Last edited by mejokj on Tue Feb 02, 2021 4:37 am, edited 1 time in total.
Re: Nagios Portal Issue
If you recently added a bunch of servers being monitored, ndo2db might be choking on the amount of traffic in the form of perf data it's sending to the database.
This was in your system profile (output from "ipcs") - the numbers are a little high:
If you upgrade Nagios to at least version 5.7.0, this problem may go away. ndo2db, (sometimes called the Database Backend) is no longer needed in Nagios XI.
ndo2db is our older technology that basically listens on a UNIX socket for database inserts, then handles the actual insertion into the database. It has limits, being that it runs into issues when it tries to insert more than the database can handle. In newer versions (Nagios XI 5.7.0 and later), this was replaced by just writing directly to the database from the Nagios worker threads. In addition to being able to handle more database inserts, this resulted in an overall performance boost, too.
If you absolutely don't want to, or can't upgrade, another solution would be increasing the message queue size via a sysctl.conf entry.
This was in your system profile (output from "ipcs") - the numbers are a little high:
Code: Select all
------ Message Queues --------
key msqid owner perms used-bytes messages
0x94000080 294957 nagios 600 12527616 12234
0xa8000080 294966 nagios 600 13968384 13641ndo2db is our older technology that basically listens on a UNIX socket for database inserts, then handles the actual insertion into the database. It has limits, being that it runs into issues when it tries to insert more than the database can handle. In newer versions (Nagios XI 5.7.0 and later), this was replaced by just writing directly to the database from the Nagios worker threads. In addition to being able to handle more database inserts, this resulted in an overall performance boost, too.
If you absolutely don't want to, or can't upgrade, another solution would be increasing the message queue size via a sysctl.conf entry.
If you didn't get an 8% raise over the course of the pandemic, you took a pay cut.
Discussion of wages is protected speech under the National Labor Relations Act, and no employer can tell you you can't disclose your pay with your fellow employees.
Discussion of wages is protected speech under the National Labor Relations Act, and no employer can tell you you can't disclose your pay with your fellow employees.
Re: Nagios Portal Issue
Thanks for the reply.
We will plan for an upgrade.
We could see that kernel parameters kernel.msgmnb and kernel.msgmax is already set to the recommended value of 262144000
Shall we increase the value more as temporary to fix the issue and what should be the next recommended value?
Thanks
We will plan for an upgrade.
We could see that kernel parameters kernel.msgmnb and kernel.msgmax is already set to the recommended value of 262144000
Shall we increase the value more as temporary to fix the issue and what should be the next recommended value?
Thanks
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Nagios Portal Issue
HI,
One of the best things you can do, however, is to increase the check intervals so the Nagios process has to schedule fewer checks every minute. Increasing the interval from 5 minutes to 10 minutes will really help.
--Benjamin
I would try doubling those values, for example (Cent 7 commands).We could see that kernel parameters kernel.msgmnb and kernel.msgmax is already set to the recommended value of 262144000
Code: Select all
sed -i 's/^kernel\.msgmnb.*/kernel\.msgmnb = 524288000 /g' /etc/sysctl.conf
sed -i 's/^kernel\.msgmax.*/kernel\.msgmax = 524288000 /g' /etc/sysctl.conf
--Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Nagios Portal Issue
Hi benjaminsmith,
We have updated kernel parameters kernel.msgmnb and kernel.msgmax to 524288000.
Nagios looks better now. However, the apply configuration is taking too much time and it did not show any error. Attached screenshot for your reference.
Did not see any configuration error
Is this because of ndo2db?
Thanks
We have updated kernel parameters kernel.msgmnb and kernel.msgmax to 524288000.
Nagios looks better now. However, the apply configuration is taking too much time and it did not show any error. Attached screenshot for your reference.
Did not see any configuration error
Is this because of ndo2db?
Thanks
You do not have the required permissions to view the files attached to this post.
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Nagios Portal Issue
Hi,
Go ahead and run the database repair script, and then try to Apply Configuration again.
If it's still taking a long time, please run the following tail command, apply configuration and post the full output to the thread.
Thanks,
Benjamin
Go ahead and run the database repair script, and then try to Apply Configuration again.
Code: Select all
/usr/local/nagiosxi/scripts/repair_databases.sh
Code: Select all
tail -f /usr/local/nagiosxi/var/cmdsubsys.log
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Nagios Portal Issue
Hi Benjaminsmith,
We have updated the message queue size via a sysctl.conf entry to the highest value 524288000. But the issue still persists (blank details on Nagios dashboard). There is no error found during the database repair
So We have decided to upgrade Nagios as suggested. We have few questions about the upgrade.
1) Do we need to upgrade to the latest version or exactly 5.7.0? The existing version is Nagios 5.6.11
2) Please provide the steps for downgrade in case if any issues occur during the upgrade.
3) Any common issue and fixes are there during the upgrades from the 5.6 to 5.7 version
Thanks
We have updated the message queue size via a sysctl.conf entry to the highest value 524288000. But the issue still persists (blank details on Nagios dashboard). There is no error found during the database repair
So We have decided to upgrade Nagios as suggested. We have few questions about the upgrade.
1) Do we need to upgrade to the latest version or exactly 5.7.0? The existing version is Nagios 5.6.11
2) Please provide the steps for downgrade in case if any issues occur during the upgrade.
3) Any common issue and fixes are there during the upgrades from the 5.6 to 5.7 version
Thanks
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Nagios Portal Issue
Hi,
Upgrading Nagios XI
Then edit your /usr/local/nagios/etc/nagios.cfg and make sure this line is uncommented:
Make sure this line is commented:
Then start the nagios service:
See: https://support.nagios.com/kb/article/m ... r-892.html
Be sure to take full backup or upgrade your test server first, before proceeding with the upgrade on the production server.
Backing Up And Restoring Your Nagios XI System
I would recommend upgrading to the latest version, 5.8.2, as we continue to make improvements to ndo3, and that version will have the latest updates.) Do we need to upgrade to the latest version or exactly 5.7.0? The existing version is Nagios 5.6.11
Upgrading Nagios XI
Here are the steps for a standard downgrade (i.e. local database host) of NDO. If the database is offloaded, let me know and I will provide the additional steps for that setup.Please provide the steps for downgrade in case if any issues occur during the upgrade.
Code: Select all
systemctl stop nagios
cd /tmp
rm -rf /tmp/nagiosxi
wget https://assets.nagios.com/downloads/nagiosxi/5/xi-5.6.14.tar.gz
tar zxf xi-5.6.14.tar.gz
cd /tmp/nagiosxi/subcomponents/ndoutils
./install
systemctl enable ndo2db
Code: Select all
broker_module=/usr/local/nagios/bin/ndomod.o config_file=/usr/local/nagios/etc/ndomod.cfg
Code: Select all
#broker_module=/usr/local/nagios/bin/ndo.so /usr/local/nagios/etc/ndo.cfg
Code: Select all
systemctl start nagios
If you're running Cent 6, then upgrading is not going to be possible. You'll need to migrate to a newer, supported distribution.Any common issue and fixes are there during the upgrades from the 5.6 to 5.7 version
See: https://support.nagios.com/kb/article/m ... r-892.html
Be sure to take full backup or upgrade your test server first, before proceeding with the upgrade on the production server.
Backing Up And Restoring Your Nagios XI System
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Nagios Portal Issue
Hi Benjamin,
We have upgraded the Nagiosxi to the latest version 5.8.2. But the issue still persists.
Sent you the system profile file as PM to you.
root@nagiosxi store]# cat /etc/sysctl.conf
# System default settings live in /usr/lib/sysctl.d/00-system.conf.
# To override those settings, enter new settings here, or in an /etc/sysctl.d/<name>.conf file
#
# For more information, see sysctl.conf(5) and sysctl.d(5).
kernel.msgmnb = 524288000
kernel.msgmax = 524288000
kernel.shmmax = 4294967295
kernel.shmall = 268435456
[root@nagiosxi store]#
Please help us with this as this issue is pending for a long time.
Thanks
We have upgraded the Nagiosxi to the latest version 5.8.2. But the issue still persists.
Sent you the system profile file as PM to you.
root@nagiosxi store]# cat /etc/sysctl.conf
# System default settings live in /usr/lib/sysctl.d/00-system.conf.
# To override those settings, enter new settings here, or in an /etc/sysctl.d/<name>.conf file
#
# For more information, see sysctl.conf(5) and sysctl.d(5).
kernel.msgmnb = 524288000
kernel.msgmax = 524288000
kernel.shmmax = 4294967295
kernel.shmall = 268435456
[root@nagiosxi store]#
Please help us with this as this issue is pending for a long time.
Thanks
You do not have the required permissions to view the files attached to this post.