Currently we get sometimes notified by services that are in scheduled downtime. I am a bit puzzled. Also on the GUI does not always show that a service has scheduled downtime.
We use Nagios XI 5.3.2
I added Host and all its services to scheduled downtime using the command file.
[1479728291] EXTERNAL COMMAND: SCHEDULE_HOST_DOWNTIME;lbnss22;1479728291;1479731891;1;0;3600;ch002854;Schedule Downtime
[1479728291] EXTERNAL COMMAND: SCHEDULE_HOST_SVC_DOWNTIME;lbnss22;1479728291;1479731891;1;0;3600;ch002854;Scheduled Downtime
[1479728291] HOST DOWNTIME ALERT: lbnss22;STARTED; Host has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Worker choked;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Total Service Problems;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Total Host Problems;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;PassiveServiceChecks 5min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;PassiveServiceChecks 1min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;PassiveServiceChecks 15min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;PassiveHostChecks 5min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;PassiveHostChecks 1min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;PassiveHostChecks 15min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;NRPE Service;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Local Filesystems;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Load Average;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;HighCommandBufferUsage;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;FS corruption;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ExternalCommandsUsed 5min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ExternalCommandsUsed 1min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Check Nagios command file;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Check Nagios API Service;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;Check Linux API Service;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;AvgServiceExecTime;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;AvgPassiveHostLatency;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;AvgHostExecTime;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;AvgCommandBufferUsage;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;AvgActiveServiceLatency;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;AvgActiveHostLatency;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ActiveServiceChecks 5min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ActiveServiceChecks 1min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ActiveServiceChecks 15min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ActiveHostChecks 5min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ActiveHostChecks 1min;STARTED; Service has entered a period of scheduled downtime
[1479728291] SERVICE DOWNTIME ALERT: lbnss22;ActiveHostChecks 15min;STARTED; Service has entered a period of scheduled downtime
The log file states that the checks have entered scheduled downtime period.
[1479728579] SERVICE ALERT: lbnss22;Check Nagios API Service;CRITICAL;HARD;1;CRITICAL: Status CRITICAL.URLError[[Errno 111] Connection refused], url:http://10.32.30.22:8080
[1479728579] SERVICE NOTIFICATION: Linux Team E-Mail;lbnss22;Check Nagios API Service;CRITICAL;notify-service-by-email;CRITICAL: Status CRITICAL.URLError[[Errno 111] Connection refused], url:http://10.32.30.22:8080
[1479728579] SERVICE NOTIFICATION: Linux Team SMS;lbnss22;Check Nagios API Service;CRITICAL;notify-service-by-inConsole-group;CRITICAL: Status CRITICAL.URLError[[Errno 111] Connection refused], url:http://10.32.30.22:8080
Also looking at the GUI sometimes the downtime icon is there. Sometimes not. Looking at the screenshot all services should have the scheduled downtime icon.
downtime.PNG
You do not have the required permissions to view the files attached to this post.
Last edited by dwhitfield on Tue Nov 22, 2016 3:40 pm, edited 1 time in total.
Reason:marking with green check mark
Could you PM a profile over? This will have quite a few files to look at that should have some interesting information. (Admin -> System Profile -> Download Profile)
Also, a few things to verify -
1. Can you show us a screenshot of the scheduled downtime on the XI interface?
2. Is it applying properly in the Core interface? Could you post a screenshot of the Core interface - http://ip.of.nagios/nagios/ (replace ip.of.nagios with the IP/hostname of the machine) - then login with an admin account.
3. Please run the following commands and post the output, to verify the timezone set on your XI machine.
rkennedy wrote:Could you PM a profile over? This will have quite a few files to look at that should have some interesting information. (Admin -> System Profile -> Download Profile)
Also, a few things to verify -
1. Can you show us a screenshot of the scheduled downtime on the XI interface?
2. Is it applying properly in the Core interface? Could you post a screenshot of the Core interface - http://ip.of.nagios/nagios/ (replace ip.of.nagios with the IP/hostname of the machine) - then login with an admin account.
3. Please run the following commands and post the output, to verify the timezone set on your XI machine.
[root][lbnss22][/usr/local/nagios/libexec][git::master][0]
# ls -l /etc/localtime
lrwxrwxrwx 1 root root 33 Mar 2 2015 /etc/localtime -> /usr/share/zoneinfo/Europe/Zurich
[root][lbnss22][/usr/local/nagios/libexec][git::master][0]
# php -r 'echo date("D M j G:i:s T Y")."\n";'
No log handling enabled - turning on stderr logging
/usr/local/nagioslogserver/mibs/NAGIOS-ROOT-MIB.txt: No such file or directory
/usr/local/nagioslogserver/mibs/NAGIOS-NOTIFY-MIB.txt: No such file or directory
/usr/local/nagioslogserver/mibs/NAGIOS-ROOT-MIB.txt: No such file or directory
/usr/local/nagioslogserver/mibs/NAGIOS-NOTIFY-MIB.txt: No such file or directory
Mon Nov 21 17:05:54 CET 2016
[root][lbnss22][/usr/local/nagios/libexec][git::master][0]
# date
Mon Nov 21 17:05:55 CET 2016
[root][lbnss22][/usr/local/nagios/libexec][git::master][0]
# mysql -unagiosxi -pn@gweb -e "SELECT NOW();"
ERROR 1045 (28000): Access denied for user 'nagiosxi'@'localhost' (using password: YES)
service nagios stop
killall -9 nagios
service ndo2db stop
service mysqld restart
for i in `ipcs -q | grep nagios |awk '{print $2}'`; do ipcrm -q $i; done
service ndo2db start
service nagios start
And you should only have two nagios daemon processes, any more any you have a problem: