need critical_max_check_attempts
Posted: Fri Jan 15, 2016 11:29 am
One of the problems with the CPU check is that sometimes 100% CPU utilization is OK. Generally LOADAVE is a much better indicator of system that's saturated and thus a more worthy plugin for notification sending.
CPU checks can cause too many notifications because of the ceiling. You can't set a threshold greater than 100% utilization (or 0% idle, is it also may be measured).
So I'm thinking it would be good to have a critical_max_check_attempts for CPU. This way, we could have a state change based on number of consecutive non-OK checks, and not necessarily the value of the return result.
I guess this could be achieved with a smart plugin, that tracked its own attempt count, returning 2 when it achieved critical_max_check_attempts. But also may be useful as a server config. What do you think ?
CPU checks can cause too many notifications because of the ceiling. You can't set a threshold greater than 100% utilization (or 0% idle, is it also may be measured).
So I'm thinking it would be good to have a critical_max_check_attempts for CPU. This way, we could have a state change based on number of consecutive non-OK checks, and not necessarily the value of the return result.
I guess this could be achieved with a smart plugin, that tracked its own attempt count, returning 2 when it achieved critical_max_check_attempts. But also may be useful as a server config. What do you think ?