Mass acknowledge or scheduled downtime

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
nagios_aws
Posts: 76
Joined: Wed May 18, 2016 3:34 am

Mass acknowledge or scheduled downtime

Post by nagios_aws »

Hello everyone,

since I saw it a couple of times, I will explain my problem :

sometimes, I or admin users can't acknowledge problem or Schedule a downtime due to errors (in red in Nagios Web portal).

I don't have the error right now, but is there a way to find it in logs ? what can be the cause of this ?

thank you
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Mass acknowledge or scheduled downtime

Post by rkennedy »

You could probably scan through apache2 logs, but these will not tell us much unless it's related to an external resource.

Can you post a screenshot of the error you receive, so we can better help you with the issue at hand?
Former Nagios Employee
nagios_aws
Posts: 76
Joined: Wed May 18, 2016 3:34 am

Re: Mass acknowledge or scheduled downtime

Post by nagios_aws »

Hello,

I finally manage to get a screenshot of this, please find it in attachment.

there is absolutely no informations about what went wrong, is it normal ?

Thank you
You do not have the required permissions to view the files attached to this post.
dwhitfield
Former Nagios Staff
Posts: 4583
Joined: Wed Sep 21, 2016 10:29 am
Location: NoLo, Minneapolis, MN
Contact:

Re: Mass acknowledge or scheduled downtime

Post by dwhitfield »

It's likely we'll need more than 30, but the time you see this, but try

Code: Select all

tail -30 /var/log/messages
tail -30 /var/log/httpd/error_log
If you don't see anything relevant, up it to 100.

Also, what version of XI and the component are you using?

One last question...the screenshot is cut off on the right. I'm assuming there's no pull down menu, links, etc. over on that side?
nagios_aws
Posts: 76
Joined: Wed May 18, 2016 3:34 am

Re: Mass acknowledge or scheduled downtime

Post by nagios_aws »

hello,

I got this problem this morning again, nobody was able to acknowledge problems ...

I check in the "/var/log/httpd/error_log" and here is the result :

Code: Select all

[Thu Feb 23 10:10:16 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:17 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:20 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/dashboard.php?&=
[Thu Feb 23 10:10:25 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:25 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:28 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/dashboard.php?&=
sh: /usr/local/nagios/var/rw/nagios.cmd: Permission denied
sh: /usr/local/nagios/var/rw/nagios.cmd: Permission denied
sh: /usr/local/nagios/var/rw/nagios.cmd: Permission denied
[Thu Feb 23 10:10:40 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:41 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:22:46 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/dashboard.php?&=
I don't know if it's related since it works sometimes without any modifications ...

i'm using 5.2.7 version of Nagios XI,

I have nothing more than what on the screenshot, I attached one more of this morning problem

any ideas ?
You do not have the required permissions to view the files attached to this post.
dwhitfield
Former Nagios Staff
Posts: 4583
Joined: Wed Sep 21, 2016 10:29 am
Location: NoLo, Minneapolis, MN
Contact:

Re: Mass acknowledge or scheduled downtime

Post by dwhitfield »

Is it the same host or service each time by chance?

What's the output of the following:

Code: Select all

ll /usr/local/nagios/var/rw/
ll /usr/local/nagiosxi/html/includes/components/massacknowledge
sestatus
/etc/hosts
Also, in Admin > System Config > Manage System Config, what are your "Program URL" and "External URL" fields?

I suspect this would happen all the time if this were the case, but do any of your users have spaces in them?
nagios_aws
Posts: 76
Joined: Wed May 18, 2016 3:34 am

Re: Mass acknowledge or scheduled downtime

Post by nagios_aws »

Hello, here are the results of commands :

Code: Select all

ll /usr/local/nagios/var/rw/
total 0
prw-rw---- 1 nagios nagios 0 Feb 24 07:34 nagios.cmd
srw-rw---- 1 nagios nagios 0 Feb 24 07:34 nagios.qh

Code: Select all

ll /usr/local/nagiosxi/html/includes/components/massacknowledge
total 40
-rwxr-xr-x 1 nagios nagios  2384 Jun  6  2016 CHANGELOG.txt
-rwxr-xr-x 1 nagios nagios 25916 Jun  6  2016 index.php
-rwxr-xr-x 1 nagios nagios  2630 Jun  6  2016 massacknowledge.inc.php
-rwxr-xr-x 1 nagios nagios   576 Jun  6  2016 sanity.sh

Code: Select all

sestatus
SELinux status:                 disabled

Code: Select all

more /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1               localhost.localdomain localhost
::1                     localhost6.localdomain6 localhost6



MY-IP   MY-FQDN MY-SHORTNAME
the "Program URL" and "External URL"
2017-02-24_09h57_10.png
I don't have any space in "username", but I do have some in "Name" field

Any ideas ?
You do not have the required permissions to view the files attached to this post.
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Mass acknowledge or scheduled downtime

Post by ssax »

Please run these commands and let me know if it resolves the issue:

Code: Select all

service nagios stop
killall -9 nagios
service ndo2db stop
for i in `ipcs -q | grep nagios |awk '{print $2}'`; do ipcrm -q $i; done
rm -rf /usr/local/nagios/var/rw/*
chown nagios.nagcmd /usr/local/nagios/var/rw
chmod 775 /usr/local/nagios/var/rw
chmod g+s /usr/local/nagios/var/rw
service ndo2db start
service nagios start
Do you use SNMP traps? If so, please send me your /usr/local/bin/snmptraphandling.py.
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Mass acknowledge or scheduled downtime

Post by ssax »

I added another command in there (the chown one), if you already ran it, please run it again.

Thank you
Locked