I'm having a strange problem that I haven't seen before. I have two identical remote Nagios installs on the same remote network, monitoring only local devices (one is monitoring RTD's, the other is monitoring CANBus). The hardware and software is identical (they were built at the same time) with the exception that one has RTD interfaces and the other has CANx. On the RTD one, passive checks are sent back to XI very reliably without any hiccups or problems. On the CANx one, some passive checks are sent seemingly reliably, some unreliably, and some not at all. Concerningly, it's apparent even for the preconfigured localhost checks. On the remote Nagios, all active checks are appearing completely fine. I have tried putting in a Freshness check but it has not resolved the problem
Example:
Nagios XI shows this:
Code: Select all
Memory Usage
Passive Only Check
Ok
14h 33m 50s
1/1
2022-02-23 08:32:05
OK: No data received yet.Code: Select all
Memory Usage
Ok
15h 39m 30s
1/5
2022-02-23 08:33:21
OK - 7282 / 7812 MB (93%) Free Memory, Used: 497 MB, Shared: 26 MB, Buffers + Cached: 343 MBAnother oddity - while the "Service Status - mysqld" check is running happily and successfully on the remote Nagios, the check has never managed to show up on Nagios XI (it has never appeared in Unconfigured Objects).
These checks all run fine on the other remote Nagios install and show up on Nagios XI without a problem.
Any thoughts? I was considering deleting the host and all checks from Nagios XI and allowing them to come in again as Unconfigured Objects. I should also note that the checkresults folder is empty. Things appear there for brief seconds but disappear immediately. It's acting like the remote Nagios is just not sending some of the NDRP data to Nagios XI, but I haven't found a place to watch for the sends to see if it's missing any.
Thank you so much in advance for any suggestions!