Measuring Website Reliability
Posted: Tue May 15, 2018 1:24 pm
Hey Guys,
We recently had an issue where a webserver was denying approx 25% of requests. This was very bad but Nagios didn't alert us, since it requires 4 consecutive failures. Even 2 consequtive failures would mean it would only have a 1 in 16 chance of alerting us.
So, I'd like a plugin that will detect low reliability. I was thinking one that if the website was down (couldn't connect). It would then do a series of checks and see if the website reliability is too low. For example it could repeatedly load a url, say 1k times over a 60second period, and if the reliability is less than 0.1% alert us.
There are lots of "gotchas" to this, the url would need to not load the server much if at all, 1k times over a 60 second period would be ideal, but not likely reachable without forking or multiple threads, etc... But the idea is what I'm looking for.
I searched the forums, and the exchange with no such luck is anyone aware of something that will do this?
Thanks!
Joey
We recently had an issue where a webserver was denying approx 25% of requests. This was very bad but Nagios didn't alert us, since it requires 4 consecutive failures. Even 2 consequtive failures would mean it would only have a 1 in 16 chance of alerting us.
So, I'd like a plugin that will detect low reliability. I was thinking one that if the website was down (couldn't connect). It would then do a series of checks and see if the website reliability is too low. For example it could repeatedly load a url, say 1k times over a 60second period, and if the reliability is less than 0.1% alert us.
There are lots of "gotchas" to this, the url would need to not load the server much if at all, 1k times over a 60 second period would be ideal, but not likely reachable without forking or multiple threads, etc... But the idea is what I'm looking for.
I searched the forums, and the exchange with no such luck is anyone aware of something that will do this?
Thanks!
Joey