2017-07-07 05:18:31 95296 Nagios XI INFO localhost User submitted a command to the subsystem (ID=1119)
2017-07-07 05:01:02 95295 Nagios XI INFO localhost User submitted a command to the subsystem (ID=1117)
At 5:01 nagios does an SSH backup. I am not sure what the process at 5:18 is.
ID=1119 is COMMAND_DELETE_SYSTEM_BACKUP, so it deleted a previous backup.
This shouldn't at all cause your setup to fail.. Before you kill off the processes do you note if there are multiple nagios processes?
I have seen this happen on a Nagios restart if it has to wait too long for mod_gearman workers to return their results
There were multiple nagios processes. No one had made any change to cause Nagios to restart at ~5:25AM. No one logged into the server until about 5:40 and commands were not submitted.
Ah! That's right. I forgot to re-implement this! I believe this will help resolve the issue. I am assuming that if I edit this file and a host or service is added/changed that it should take effect next time someone hits apply?
Yes, the next time the Nagios Process is restarted (Apply Config) it will wait longer to restart the process and hopefully the issue will not happen for you again.
Be sure to check out our Knowledgebase for helpful articles and solutions!