Page 2 of 4

Re: Getting "Error: Could not read host and service status".

Posted: Thu Nov 05, 2015 9:57 am
by rkennedy
Sounds good, let us know your results once you look into httpd.

Re: Getting "Error: Could not read host and service status".

Posted: Tue Dec 08, 2015 7:19 pm
by gavinh
Finally was able to circle back, changing script execution in my php.ini's did not resolve this. Replacing the HD isn't a realistic option, any other ideas?

Re: Getting "Error: Could not read host and service status".

Posted: Tue Dec 08, 2015 7:58 pm
by Box293
What Operating System is Nagios installed on?

Re: Getting "Error: Could not read host and service status".

Posted: Thu Apr 28, 2016 1:05 pm
by gavinh
Following up on this thread. I made changes to httpd.conf and also added a MySQL backend, hoping things would clear up. But I still get these errors and have a serious need to make them go away for the sake of keeping Nagios as our monitoring solution. Any other ideas?

Re: Getting "Error: Could not read host and service status".

Posted: Thu Apr 28, 2016 2:02 pm
by rkennedy
What OS do you have Nagios installed on? Additionally, what other addons / components are you using in conjunction with Nagios?

As you've made changes to httpd.conf, and added a SQL backend, it really becomes harder to further troubleshoot where the issue is.

While we made Core, we can't provide support for external products such as Core running a SQL backend.

Please post your httpd.conf, and entire conf.d folder for us to look at. Also post your apache error and access log for us to look at.

Re: Getting "Error: Could not read host and service status".

Posted: Mon May 02, 2016 5:04 pm
by gavinh
This is on Ubuntu 14.04.

Only change I made in Apache was to the timeouts. This issue started months before I added MySQL and its still a problem. So I do not think MySQL is related. I added in the graphing plugin, pnp4nagios as well. Here are the entries to my configs.

commands.cfg

Code: Select all

# 'process-host-perfdata' command definition
define command{
        command_name    process-host-perfdata
        command_line    /usr/bin/printf "%b" "$LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n" >> /var/lib/nagios3/host-perfdata.out
        }


# 'process-service-perfdata' command definition
define command{
        command_name    process-service-perfdata
        command_line    /usr/bin/printf "%b" "$LASTSERVICECHECK$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICESTATE$\t$SERVICEATTEMPT$\t$SERVICESTATETYPE$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\n" >> /var/lib/nagios3/service-perfdata.out
        }

define command {
command_name    process-service-perfdata-file
command_line    /usr/lib/pnp4nagios/libexec/process_perfdata.pl --bulk=/var/lib/pnp4nagios/perfdata/hosts
}

define command {
command_name    process-host-perfdata-file
command_line    /usr/lib/pnp4nagios/libexec/process_perfdata.pl --bulk=/var/lib/pnp4nagios/perfdata/services
}
nagios.cfg

Code: Select all

broker_module=/usr/lib/check_mk/livestatus.o /usr/lib/nagios/mk-livestatus/live
broker_module=/usr/lib/ndoutils/ndomod-mysql-3x.o config_file=/etc/nagios3/ndomod.cfg
event_broker_options=-1
Again this timeout issue existed before all of this was added.

Re: Getting "Error: Could not read host and service status".

Posted: Mon May 02, 2016 5:10 pm
by gavinh
I have 920 hosts and 7200 service tests.

Re: Getting "Error: Could not read host and service status".

Posted: Mon May 02, 2016 5:20 pm
by gavinh
I enabled use_large_installation_tweaks=1 as well just now.

Re: Getting "Error: Could not read host and service status".

Posted: Tue May 03, 2016 12:33 pm
by gavinh
Enabling tweaks did not work. Once the error presents itself, hitting refresh 5-6 time usually loads the page. Mgmt is starting to frown on this though. ;/

Re: Getting "Error: Could not read host and service status".

Posted: Tue May 03, 2016 12:38 pm
by gavinh
I was watching the nagios.log and what I am seeing is this;

Services keep stopping and restarting themselves.

Code: Select all

[1462273898] Warning: Return code of 255 for check of service 'Check IOstat' on host 'MOBILE-U64-926' was out of bounds.
[1462273898] Warning: Return code of 255 for check of service 'Check IOstat' on host 'MOBILE-U64-141' was out of bounds.
[1462273898] Warning: Return code of 255 for check of service 'Check System Load' on host 'MOBILE-U64-141' was out of bounds.
[1462273898] Warning: Return code of 255 for check of service 'Git Sync Delay' on host 'MOBILE-U64-141' was out of bounds.
[1462273899] Warning: Return code of 255 for check of service 'Check System Load' on host 'MOBILE-U64-82' was out of bounds.
[1462273899] Warning: Return code of 255 for check of service 'Disk Usage - /' on host 'MOBILE-U64-82' was out of bounds.
[1462273899] Warning: Return code of 255 for check of service 'Builder Java' on host 'MOBILE-U64-83' was out of bounds.
[1462273899] Warning: Return code of 255 for check of service 'Puppet Last Run' on host 'MOBILE-U64-83' was out of bounds.
[1462273899] Warning: Return code of 255 for check of service 'Git Sync Delay' on host 'MOBILE-U64-926' was out of bounds.
[1462273899] Warning: Return code of 255 for check of service 'Check Disk' on host 'MOBILE-WIN-46' was out of bounds.
[1462273908] Warning: Return code of 255 for check of service 'Git Sync Delay' on host 'MOBILE-U64-82' was out of bounds.
[1462273918] Warning: Return code of 255 for check of service 'Builder Java' on host 'MOBILE-U64-519' was out of bounds.
[1462273918] Warning: Return code of 255 for check of service 'Check Memory Usage' on host 'MOBILE-U64-83' was out of bounds.
[1462273918] Warning: Return code of 255 for check of service 'Check IOstat' on host 'MOBILE-U64-874' was out of bounds.
[1462273918] Warning: Return code of 255 for check of service 'Puppet Last Run' on host 'MOBILE-U64-926' was out of bounds.
[1462273918] Warning: Return code of 255 for check of service 'Check System Load' on host 'MOBILE-U64-83' was out of bounds.
[1462273922] Caught SIGTERM, shutting down...
[1462273922] Successfully shutdown... (PID=24279)
[1462273922] ndomod: Shutdown complete.
[1462273922] Event broker module '/usr/lib/ndoutils/ndomod-mysql-3x.o' deinitialized successfully.
[1462273923] livestatus: Socket thread has terminated
[1462273923] Event broker module '/usr/lib/check_mk/livestatus.o' deinitialized successfully.
[1462273939] Nagios 3.5.1 starting... (PID=32651)
[1462273939] Local time is Tue May 03 04:12:19 PDT 2016
[1462273939] LOG VERSION: 2.0
[1462273939] ndomod: NDOMOD 1.4b9 (10-27-2009) Copyright (c) 2009 Nagios Core Development Team and Community Contributors
[1462273939] ndomod: Successfully connected to data sink.  0 queued items to flush.
[1462273939] Event broker module '/usr/lib/ndoutils/ndomod-mysql-3x.o' initialized successfully.
[1462273939] livestatus: Livestatus 1.2.2p3 by Mathias Kettner. Socket: '/usr/lib/nagios/mk-livestatus/live'
[1462273939] livestatus: Please visit us at http://mathias-kettner.de/
[1462273939] livestatus: Hint: please try out OMD - the Open Monitoring Distribution
[1462273939] livestatus: Please visit OMD at http://omdistro.org
[1462273939] livestatus: Warning: environment_macros are enabled. This might decrease the overall nagios performance
[1462273939] livestatus: archive path /var/log/nagios3/archives
[1462273939] livestatus: Finished initialization. Further log messages go to /var/log/nagios3/livestatus.log
[1462273939] Event broker module '/usr/lib/check_mk/livestatus.o' initialized successfully.
[1462273939] Finished daemonizing... (New PID=32654)