I have a problem with the process monitor, it seems to be behind schedule?
As well as receiving this warning on all services in the eventlog Warning: The check of service 'nagiosxi-64 VM Status' on host 'tsamarvca.tharisa.com' looks like it was orphaned (results never came back; last_check=1409025921; next_check=1409026579). I'm scheduling an immediate check of the service...
And after applying the configuration, when adding or editing hosts / services, it reports that the active host and service checks and notifications are disabled, this corrects after an hour
This issue started after the disk filled up with backups.
You do not have the required permissions to view the files attached to this post.
I have run the command and removed the retention file, this had no effect, the time of the events in the eventlog are still 8+ hours behind
Restarting the server has the same effect as restarting the nagios service, active checks and notifications are disabled.
I have to start the monitoring engine or the processing manually after restarting the appliance/ nagios service or when adding hosts/services to enable the checks, or wait 30+ minutes for it to start automatically.
Here is the status of the monitoring process.
monitor.PNG
You do not have the required permissions to view the files attached to this post.
What version of nagios XI are you presently running? Any other neb modules, such as mod_gearman or livestatus? Just to clarify, you did a full server reboot and time did not correct itself? We may need to clear the retention.dat file, so that scheduling will not be so far in the future since it is likely partially if not fully off by the previous time issues. Simply moving the file like below, and restarting the nagios daemon would recreate it.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
No custom configurations or modules, except for the install of the vmware perl sdk and yum updates.
VMware 64bit appliance
XI 2014R1.4
CentOS release 6.5 (Final)
cpe:/o:centos:linux:6:GA
Full server reboot(s) or nagios service restart(s) does not correct the time.
Renaming or removing the retention.dat does reset the checks to the correct time, but it doesnt keep up and ends up begind and the warning about orphaned ckecks are present again
I do get the following when restarting the nagios service: Warning - nagios did not exit in a timely manner
After restarting the nagios service the monitoring engine stops, I have to start this manually. Or the process state is stoped and i have to start it manually.