NRPE and alerts
Posted: Mon Mar 19, 2012 6:38 am
Dear Support team,
We are planning to roll out NRPE on Linux servers in our infrastructure to monitor disks, hardware and system load.
I installed NRPE on few servers for testing purpose and everything seems fine. When I shutdown a server it will raise a ticket for the host down but at times the server takes sometime before it can shutdown all services before going for a halt. In such a case Nagios would raise a service ticket stating that NRPE has been timed out and later a host down ticket. This behaviour is pretty genuine. But in my case it is an unwanted and double alert. Is there a trick to overcome this?
I understand one way is to increase the value of max_check_attempts for NRPE service check but we don't want to risk increasing it too high.
Thanks in advance.
We are planning to roll out NRPE on Linux servers in our infrastructure to monitor disks, hardware and system load.
I installed NRPE on few servers for testing purpose and everything seems fine. When I shutdown a server it will raise a ticket for the host down but at times the server takes sometime before it can shutdown all services before going for a halt. In such a case Nagios would raise a service ticket stating that NRPE has been timed out and later a host down ticket. This behaviour is pretty genuine. But in my case it is an unwanted and double alert. Is there a trick to overcome this?
I understand one way is to increase the value of max_check_attempts for NRPE service check but we don't want to risk increasing it too high.
Thanks in advance.