Nagios XI jobs showing warning

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Bala.Mutyam
Posts: 97
Joined: Wed Apr 29, 2020 12:18 pm

Nagios XI jobs showing warning

Post by Bala.Mutyam »

Hi,

Our Nagios NagiosXI Jobs monitoring displaying as warning but System Component Status is all green and also restarted all nagiosxi components using below commands (https://support.nagios.com/forum/viewto ... =6&t=33494) but still the same issue. Can you please have a look?

Code: Select all

service nagios stop
service npcd stop
service ndo2db stop (service doensn't)
service mysqld stop (service doensn't)
service postgresql stop (service doensn't)
service httpd stop
service crond stop
service crond start
service httpd start
service postgresql start
service mysqld start
service ndo2db start
service npcd start
service mariadb restart
Thanks
You do not have the required permissions to view the files attached to this post.
User avatar
vtrac
Posts: 903
Joined: Tue Oct 27, 2020 1:35 pm

Re: Nagios XI jobs showing warning

Post by vtrac »

Hi,
How are you doing?
It is a good sign that your System Component Status are all green, meaning ALL monitoring engines are up and running.

The jobs monitoring displaying a warning for a particular job means that particular job is falling into the "w" (warning) or "c" (critical) defined values.

Can you please share (upload) screenshots of the jobs being in the "warning" state?

To help further investigate the issue, please upload (share) the "profile.zip".
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Config" > "System Profile" Menu
Click the "Download Profile" button

If you're unable to generate the the profile through the web interface,
please try generating it from the command line by running these commands as root:

Code: Select all

rm -rf /usr/local/nagiosxi/var/components/profile*​​

/usr/local/nagiosxi/scripts/components/getprofile.sh SUPPORT
Best Regards,
Vinh
Bala.Mutyam
Posts: 97
Joined: Wed Apr 29, 2020 12:18 pm

Re: Nagios XI jobs showing warning

Post by Bala.Mutyam »

@ vtrac : Hi

I'm good, thanks. Yourself?

I've PM'd system profile.

Here is the screenshot of the warning.
You do not have the required permissions to view the files attached to this post.
User avatar
vtrac
Posts: 903
Joined: Tue Oct 27, 2020 1:35 pm

Re: Nagios XI jobs showing warning

Post by vtrac »

Hi Bala,
Please resent the "profile.zip" as I did not see any attachment in the PM.

Also, did you have a chance to try those step suggested in my last replied?


Best Regards,
Vinh
Bala.Mutyam
Posts: 97
Joined: Wed Apr 29, 2020 12:18 pm

Re: Nagios XI jobs showing warning

Post by Bala.Mutyam »

Hi Vinh,

Not sure what happened but PM'd now.
Also, did you have a chance to try those step suggested in my last replied?
Did you mean suggestion for generating profile?

Thanks
Bala
User avatar
vtrac
Posts: 903
Joined: Tue Oct 27, 2020 1:35 pm

Re: Nagios XI jobs showing warning

Post by vtrac »

Hi Bala,
Hope you are having a good day!!

Looks like you have a crashed database table. Please run the below command to repair DB:

Code: Select all

/usr/local/nagiosxi/scripts/repair_databases.sh
Please also run the below command to check "nagios" account, please post outputs if possible:

Code: Select all

chage -l nagios

Also, please run the below commands, please post outputs or screenshots if possible.

Code: Select all

systemctl stop crond
systemctl stop npcd
systemctl stop nagios
pkill -9 -u nagios

for i in $(ipcs -q | grep nagios |awk '{print $2}'); do ipcrm -q $i; done

rm -rf /usr/local/nagiosxi/var/dbmaint.lock
rm -rf /usr/local/nagiosxi/var/event_handler.lock
rm -rf /usr/local/nagiosxi/scripts/reconfigure_nagios.lock
systemctl start nagios
systemctl start npcd
systemctl start crond
systemctl restart httpd

Best Regards,
Vinh
Bala.Mutyam
Posts: 97
Joined: Wed Apr 29, 2020 12:18 pm

Re: Nagios XI jobs showing warning

Post by Bala.Mutyam »

Hi Vinh,

Thanks for the update and here all commands outputs.

Code: Select all

===============
REPAIR COMPLETE
===============
DATABASE: nagiosql
TABLE:
/var/lib/mysql/nagiosql ~
DATABASE: nagiosxi
TABLE:
/var/lib/mysql/nagiosxi ~

=======================
nagios database repair succeeded

Code: Select all

[root@localhost ~]# chage -l nagios
Last password change                                    : Apr 21, 2020
Password expires                                        : never
Password inactive                                       : never
Account expires                                         : never
Minimum number of days between password change          : 0
Maximum number of days between password change          : 99999
Number of days of warning before password expires       : 7

Code: Select all

[root@localhost ~]# systemctl stop crond
[root@localhost ~]# systemctl stop npcd
[root@localhost ~]# systemctl stop nagios
[root@localhost ~]# ps -ef | grep nagios
nagios     9395      1  0 Apr06 ?        00:05:00 /usr/local/nagios/bin/npcd -d -f /usr/local/na                                                             gios/etc/pnp/npcd.cfg
nagios    10675  10666  0 Feb19 ?        00:00:00 /bin/sh -c /usr/bin/php -q /usr/local/nagiosxi                                                             /cron/cmdsubsys.php >> /usr/local/nagiosxi/var/cmdsubsys.log 2>&1
nagios    10685  10675  0 Feb19 ?        00:00:00 /usr/bin/php -q /usr/local/nagiosxi/cron/cmdsu                                                             bsys.php
nagios    18727      1  0 Jun21 ?        00:00:03 /usr/local/nagios/bin/npcd -d -f /usr/local/na                                                             gios/etc/pnp/npcd.cfg
root     103196 127075  0 17:41 pts/0    00:00:00 grep --color=auto nagios
[root@localhost ~]# pkill -9 -u nagios
[root@localhost ~]#
[root@localhost ~]# ps -ef | grep nagios
root     103214 127075  0 17:41 pts/0    00:00:00 grep --color=auto nagios
[root@localhost ~]#
[root@localhost ~]#
[root@localhost ~]# ps -ef | grep nagios
root     103218 127075  0 17:41 pts/0    00:00:00 grep --color=auto nagios
[root@localhost ~]# for i in $(ipcs -q | grep nagios |awk '{print $2}'); do ipcrm -q $i; done
[root@localhost ~]# rm -rf /usr/local/nagiosxi/var/dbmaint.lock
[root@localhost ~]# rm -rf /usr/local/nagiosxi/var/event_handler.lock
[root@localhost ~]# rm -rf /usr/local/nagiosxi/scripts/reconfigure_nagios.lock
[root@localhost ~]# systemctl start nagios
[root@localhost ~]# systemctl start npcd
[root@localhost ~]#
[root@localhost ~]# systemctl start crond
[root@localhost ~]# systemctl restart httpd
[root@localhost ~]#
It's still a warning.

Thanks
Bala
User avatar
vtrac
Posts: 903
Joined: Tue Oct 27, 2020 1:35 pm

Re: Nagios XI jobs showing warning

Post by vtrac »

Hi,
May I ask were there any changes to the system prior to this issue?

Please provide new system profile "profile.zip".

Also, please run and update outputs:

Code: Select all

cat /etc/cron.deny

cat /etc/cron.allow

Best Regards,
Vinh
User avatar
vtrac
Posts: 903
Joined: Tue Oct 27, 2020 1:35 pm

Re: Nagios XI jobs showing warning

Post by vtrac »

Also, I could not find any host or service with the "rmfv170nxi1b" name in the "objects.cache" file as showed in the screenshot.

Please give me the host or service that is having issue.

What is the outputs of this command:

Code: Select all

/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg

Best Regards,
Vinh
Bala.Mutyam
Posts: 97
Joined: Wed Apr 29, 2020 12:18 pm

Re: Nagios XI jobs showing warning

Post by Bala.Mutyam »

Hi,
May I ask were there any changes to the system prior to this issue?
Answered via PM

Code: Select all

[root@localhost ~]# cat /etc/cron.deny
[root@localhost ~]# cat /etc/cron.allow
cat: /etc/cron.allow: No such file or directory
[root@localhost ~]#
Output of

Code: Select all

/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg
PM'd

Thanks
Bala
Locked