Page 1 of 1

Mass acknowledge or scheduled downtime

Posted: Thu Feb 16, 2017 3:12 am
by nagios_aws
Hello everyone,

since I saw it a couple of times, I will explain my problem :

sometimes, I or admin users can't acknowledge problem or Schedule a downtime due to errors (in red in Nagios Web portal).

I don't have the error right now, but is there a way to find it in logs ? what can be the cause of this ?

thank you

Re: Mass acknowledge or scheduled downtime

Posted: Thu Feb 16, 2017 10:00 am
by rkennedy
You could probably scan through apache2 logs, but these will not tell us much unless it's related to an external resource.

Can you post a screenshot of the error you receive, so we can better help you with the issue at hand?

Re: Mass acknowledge or scheduled downtime

Posted: Mon Feb 20, 2017 8:39 am
by nagios_aws
Hello,

I finally manage to get a screenshot of this, please find it in attachment.

there is absolutely no informations about what went wrong, is it normal ?

Thank you

Re: Mass acknowledge or scheduled downtime

Posted: Mon Feb 20, 2017 10:48 am
by dwhitfield
It's likely we'll need more than 30, but the time you see this, but try

Code: Select all

tail -30 /var/log/messages
tail -30 /var/log/httpd/error_log
If you don't see anything relevant, up it to 100.

Also, what version of XI and the component are you using?

One last question...the screenshot is cut off on the right. I'm assuming there's no pull down menu, links, etc. over on that side?

Re: Mass acknowledge or scheduled downtime

Posted: Thu Feb 23, 2017 4:49 am
by nagios_aws
hello,

I got this problem this morning again, nobody was able to acknowledge problems ...

I check in the "/var/log/httpd/error_log" and here is the result :

Code: Select all

[Thu Feb 23 10:10:16 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:17 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:20 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/dashboard.php?&=
[Thu Feb 23 10:10:25 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:25 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:28 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/dashboard.php?&=
sh: /usr/local/nagios/var/rw/nagios.cmd: Permission denied
sh: /usr/local/nagios/var/rw/nagios.cmd: Permission denied
sh: /usr/local/nagios/var/rw/nagios.cmd: Permission denied
[Thu Feb 23 10:10:40 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:10:41 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/
[Thu Feb 23 10:22:46 2017] [error] [client XX.XX.XX.XX] PHP Notice:  Undefined offset: 1000 in /usr/local/nagiosxi/html/includes/components/helpsystem/helpsystem.inc.php on line 252, referer: http://ServerFQDN/nagiosxi/dashboards/dashboard.php?&=
I don't know if it's related since it works sometimes without any modifications ...

i'm using 5.2.7 version of Nagios XI,

I have nothing more than what on the screenshot, I attached one more of this morning problem

any ideas ?

Re: Mass acknowledge or scheduled downtime

Posted: Thu Feb 23, 2017 3:51 pm
by dwhitfield
Is it the same host or service each time by chance?

What's the output of the following:

Code: Select all

ll /usr/local/nagios/var/rw/
ll /usr/local/nagiosxi/html/includes/components/massacknowledge
sestatus
/etc/hosts
Also, in Admin > System Config > Manage System Config, what are your "Program URL" and "External URL" fields?

I suspect this would happen all the time if this were the case, but do any of your users have spaces in them?

Re: Mass acknowledge or scheduled downtime

Posted: Fri Feb 24, 2017 4:03 am
by nagios_aws
Hello, here are the results of commands :

Code: Select all

ll /usr/local/nagios/var/rw/
total 0
prw-rw---- 1 nagios nagios 0 Feb 24 07:34 nagios.cmd
srw-rw---- 1 nagios nagios 0 Feb 24 07:34 nagios.qh

Code: Select all

ll /usr/local/nagiosxi/html/includes/components/massacknowledge
total 40
-rwxr-xr-x 1 nagios nagios  2384 Jun  6  2016 CHANGELOG.txt
-rwxr-xr-x 1 nagios nagios 25916 Jun  6  2016 index.php
-rwxr-xr-x 1 nagios nagios  2630 Jun  6  2016 massacknowledge.inc.php
-rwxr-xr-x 1 nagios nagios   576 Jun  6  2016 sanity.sh

Code: Select all

sestatus
SELinux status:                 disabled

Code: Select all

more /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1               localhost.localdomain localhost
::1                     localhost6.localdomain6 localhost6



MY-IP   MY-FQDN MY-SHORTNAME
the "Program URL" and "External URL"
2017-02-24_09h57_10.png
I don't have any space in "username", but I do have some in "Name" field

Any ideas ?

Re: Mass acknowledge or scheduled downtime

Posted: Fri Feb 24, 2017 2:51 pm
by ssax
Please run these commands and let me know if it resolves the issue:

Code: Select all

service nagios stop
killall -9 nagios
service ndo2db stop
for i in `ipcs -q | grep nagios |awk '{print $2}'`; do ipcrm -q $i; done
rm -rf /usr/local/nagios/var/rw/*
chown nagios.nagcmd /usr/local/nagios/var/rw
chmod 775 /usr/local/nagios/var/rw
chmod g+s /usr/local/nagios/var/rw
service ndo2db start
service nagios start
Do you use SNMP traps? If so, please send me your /usr/local/bin/snmptraphandling.py.

Re: Mass acknowledge or scheduled downtime

Posted: Fri Feb 24, 2017 2:53 pm
by ssax
I added another command in there (the chown one), if you already ran it, please run it again.

Thank you