Ok, in the profile from the NAGIOS GUI, so on primary it shows the DB logs, while on secondary it does not show the logs. I am not sure why, even the server it self shows "/var/log/mysqld.log".
However, for the errors you saw, those are common errors on our servers, as we monitor so many devices, and at some point NAGIOS is not able to handle all of the passive checks due to some load issue, the the DB is crashed.
This DB issue is not related to what I am referring here in this article, as the issue of "passive check not received" happens just on few devices on the primary servers, where at that time there is no database errors.
For me it seems primary NAGIOS for some reason does not like the passive check from these devices, so it does not processes them. Or it can be a device issue, and not able to send to NAGIOS all packets of passive check. We checked the devices and for us seems there is no issue.
=================
For the nagios.log is also very huge as we monitor so many devices. But i will try to send you the log for primary and secondary on the private.
==============================
Here is the version ob both primary and secondary:
cat /usr/local/nrdp/server/config.inc.php | grep product_version
$cfg['product_version']='1.3.1';
Thanks,
Nabi
Getting check results for service are stale by
-
kyang
Re: Getting check results for service are stale by
Since the stale passive checks are only occurring towards a specific amount of devices.
Could you post or PM the names of those devices so I can look at the cfg settings.
UPDATE:
Received your message, and shared with the team.
Could you post or PM the names of those devices so I can look at the cfg settings.
UPDATE:
Received your message, and shared with the team.
Last edited by kyang on Wed Apr 25, 2018 1:00 pm, edited 1 time in total.
Reason: Shared with techs!
Reason: Shared with techs!
Re: Getting check results for service are stale by
I sent you the device name with the screenshots of NAGIOS GUI to the PM.
Thanks,
Nabi
Thanks,
Nabi
-
kyang
Re: Getting check results for service are stale by
Thanks, I have received this information.
It's hard to tell, if you could provide more logs in /var/log/messages of a few minutes before and after the issue happens that could help us.
I see you have most passive service checks set to 720. How often are the NRDP passive checks being done?
It's hard to tell, if you could provide more logs in /var/log/messages of a few minutes before and after the issue happens that could help us.
I see you have most passive service checks set to 720. How often are the NRDP passive checks being done?