Nagios reported false load values
Posted: Mon Aug 11, 2014 10:16 am
Hello, all!
We had an interesting thing happen with Nagios recently. We have Nagios Core 3.2.3 monitoring a number of Linux servers distributed between virtual and physical servers, and in two separate datacenters.
After doing some maintenance on the server, Nagios suddenly reported critically high load values for all the servers it tracks - physical and virtual, both datacenters. We checked the actual servers and there were no problems with server loads. The loads came back down as Nagios polled the servers and replaced the bogus loads with real ones.
What would cause Nagios to show all these values? Since they clearly did not exist on the actual machines, I can only conclude that the maintenance in some way affected Nagios. However, the maintenance was in relation to a completely different user and application.
Any ideas on what would cause Nagios to panic like this?
Thanks to all!
We had an interesting thing happen with Nagios recently. We have Nagios Core 3.2.3 monitoring a number of Linux servers distributed between virtual and physical servers, and in two separate datacenters.
After doing some maintenance on the server, Nagios suddenly reported critically high load values for all the servers it tracks - physical and virtual, both datacenters. We checked the actual servers and there were no problems with server loads. The loads came back down as Nagios polled the servers and replaced the bogus loads with real ones.
What would cause Nagios to show all these values? Since they clearly did not exist on the actual machines, I can only conclude that the maintenance in some way affected Nagios. However, the maintenance was in relation to a completely different user and application.
Any ideas on what would cause Nagios to panic like this?
Thanks to all!