[Nagios-devel] BUG HostStateFlapping calculation - RFC
Posted: Wed Feb 15, 2006 1:29 am
This is a multi-part MIME message
--Boundary-00=_GRv8DLNWc6FA9BE
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
Hello,
IMHO at the moment, hoststateflapping is calculated at a very weird way.
A "no-state-change" event is added to the flapping history if wait_threshold
has elapsed.
Actual calculation:
wait_threshold=(hst->total_service_check_interval*interval_length)/hst->total_services;
In other words: if a host with 10 services, all in a 5 minutes interval
checked is flapping, a "no-state-change" event is added every 50 Minutes, but
hostchecks (for a down host) are executed every 30 seconds.
In our installation, we use a service called "config_backup" which makes a
backup of our routers/switches and is called every 1440 Minutes. The
wait_threshold raises very high, with only one config service attached to the
host. So a very long time without state change must be passed for a
non-flapping state.
The attached patch calculates the wait_threshold as a
"average_service_check_interval" (W):
(1/N(1)) + (1/N(2)) + ... + (1/N(n)) = (1/W)
In other words: if a host with 10 services, all in a 5 minute interval checked
is flapping, a "no-state-change" event is added every 30 seconds.
RFC
Best regards
Percy Jahn
--Boundary-00=_GRv8DLNWc6FA9BE
Content-Type: application/octet-stream
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
filename="hoststateflapping.patch"
LS0tIG5hZ2lvcy1jdnMvaW5jbHVkZS9vYmplY3RzX29yZy5oCTIwMDYtMDItMTQgMTY6NTA6
MzAuMDAwMDAwMDAwICswMTAwCisrKyBuYWdpb3MtY3ZzL2luY2x1ZGUvb2JqZWN0cy5oCTIw
MDYtMDItMTQgMTY6NTE6MDIuMDAwMDAwMDAwICswMTAwCkBAIC0yMDEsNiArMjAxLDcgQEAg
dHlwZWRlZiBzdHJ1Y3QgaG9zdF9zdHJ1Y3R7CiAJZG91YmxlICBwZXJjZW50X3N0YXRlX2No
YW5nZTsKIAlpbnQgICAgIHRvdGFsX3NlcnZpY2VzOwogCXVuc2lnbmVkIGxvbmcgdG90YWxf
c2VydmljZV9jaGVja19pbnRlcnZhbDsKKwl1bnNpZ25lZCBsb25nIGF2Z19zZXJ2aWNlX2No
ZWNrX2ludGVydmFsOwogCXVuc2lnbmVkIGxvbmcgbW9kaWZpZWRfYXR0cmlidXRlczsKIAlp
bnQgICAgIGNpcmN1bGFyX3BhdGhfY2hlY2tlZDsKIAlpbnQgICAgIGNvbnRhaW5zX2NpcmN1
bGFyX3BhdGg7Ci0tLSBuYWdpb3MtY3ZzL2Jhc2UvZmxhcHBpbmdfb3JnLmMJMjAwNi0wMi0x
NCAxNjo0OTozOS4wMDAwMDAwMDAgKzAxMDAKKysrIG5hZ2lvcy1jdnMvYmFzZS9mbGFwcGlu
Zy5jCTIwMDYtMDItMTQgMTY6NTI6MDUuMDAwMDAwMDAwICswMTAwCkBAIC0xNzUsNyArMTc1
LDcgQEAgdm9pZCBjaGVja19mb3JfaG9zdF9mbGFwcGluZyhob3N0ICpoc3QsIAogCQl3YWl0
X3RocmVzaG9sZD1oc3QtPm5vdGlmaWNhdGlvbl9pbnRlcnZhbCppbnRlcnZhbF9sZW5ndGg7
CiAJCiAJZWxzZQotCQl3YWl0X3RocmVzaG9sZD0oaHN0LT50b3RhbF9zZXJ2aWNlX2NoZWNr
X2ludGVydmFsKmludGVydmFsX2xlbmd0aCkvaHN0LT50b3RhbF9zZXJ2aWNlczsKKwkJd2Fp
dF90aHJlc2hvbGQ9aHN0LT5hdmdfc2VydmljZV9jaGVja19pbnRlcnZhbCppbnRlcnZhbF9s
ZW5ndGg7CiAKIAkvKiBpZiB3ZSBoYXZlbid0IHdhaXRlZCBsb25nIGVub3VnaCBzaW5jZSBs
YXN0IHJlY29yZCwgb25seSB1cGRhdGUgaWYgd2UndmUgaGFkIGEgc3RhdGUgY2hhbmdlICov
CiAJaWYoKGN1cnJlbnRfdGltZS1oc3QtPmxhc3Rfc3RhdGVfaGlzdG9yeV91cGRhdGUpPHdh
aXRfdGhyZXNob2xkKXsKLS0tIG5hZ2lvcy1jdnMvYmFzZS9jb25maWdfb3JnLmMJMjAwNi0w
Mi0xNCAxNjo1MjoyMS4wMDAwMDAwMDAgKzAxMDAKKysrIG5hZ2lvcy1jdnMvYmFzZS9jb25m
aWcuYwkyMDA2LTAyLTE0IDE3OjAyOjEwLjAwMDAwMDAwMCArMDEwMApAQCAtMTYzNiw3ICsx
NjM2LDcgQEAgaW50IHByZV9mbGlnaHRfY2hlY2sodm9pZCl7CiAJaW50IHRvdGFsX29iamVj
dHM9MDsKIAlpbnQgd2FybmluZ3M9MDsKIAlpbnQgZXJyb3JzPTA7Ci0KKwl1bnNpZ25lZCBs
b25nIGF2Z19pbnRlcnZhbF9jb3VudGVyPTA7CiAjaWZkZWYgREVCVUcwCiAJcHJpbnRmKCJw
cmVfZmxpZ2h0X2NoZWNrKCkgc3RhcnRcbiIpOwogI2VuZGlmCkBAIC0yODEyLDcgKzI4MTIs
MTggQEAgaW50IHByZV9mbGlnaHRfY2hlY2sodm9pZCl7CiAJCQl0ZW1wX2hvc3QtPnRvdGFs
X3NlcnZpY2VfY2hlY2tfaW50ZXJ2YWwrPXRlbXBfc2VydmljZS0+Y2hlY2tfaW50ZXJ2YWw7
CiAJCSAgICAgICAgfQogCSAgICAgICAgfQotCisJLyogY2FsY3VsYXRlIGF2ZXJhZ2Ugc2Vy
dmljZSBjaGVjayBpbnRlcnZhbCAqLworCWZvcih0ZW1wX3NlcnZpY2U9c2VydmljZV9saXN0
O3RlbXBfc2VydmljZSE9TlVMTDt0ZW1wX3NlcnZpY2U9dGVtcF9zZXJ2aWNlLT5uZXh0KXsK
KwkJaWYoKHRlbXBfaG9zdD1maW5kX2hvc3QodGVtcF9zZXJ2aWNlLT5ob3N0X25hbWUpKSl7
CisJCQlpZih0ZW1wX3NlcnZpY2UtPmNoZWNrX2ludGVydmFsICE9IDApIHsKKwkJCQlhdmdf
aW50ZXJ2YWxfY291bnRlciArPSB0ZW1wX2hvc3QtPnRvdGFsX3NlcnZpY2VfY2hlY2tfaW50
ZXJ2YWwgLyB0ZW1wX3NlcnZpY2UtPmNoZWNrX2ludGVydmFsOworCQkJCX0KKwkJCX0KKwkJ
fQorCWlmKGF2Z19pbnRlcnZhbF9jb3VudGVyID4gMCkgeworCQl0ZW1wX2hvc3QtPmF2Z19z
ZXJ2a
...[email truncated]...
This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]
--Boundary-00=_GRv8DLNWc6FA9BE
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
Hello,
IMHO at the moment, hoststateflapping is calculated at a very weird way.
A "no-state-change" event is added to the flapping history if wait_threshold
has elapsed.
Actual calculation:
wait_threshold=(hst->total_service_check_interval*interval_length)/hst->total_services;
In other words: if a host with 10 services, all in a 5 minutes interval
checked is flapping, a "no-state-change" event is added every 50 Minutes, but
hostchecks (for a down host) are executed every 30 seconds.
In our installation, we use a service called "config_backup" which makes a
backup of our routers/switches and is called every 1440 Minutes. The
wait_threshold raises very high, with only one config service attached to the
host. So a very long time without state change must be passed for a
non-flapping state.
The attached patch calculates the wait_threshold as a
"average_service_check_interval" (W):
(1/N(1)) + (1/N(2)) + ... + (1/N(n)) = (1/W)
In other words: if a host with 10 services, all in a 5 minute interval checked
is flapping, a "no-state-change" event is added every 30 seconds.
RFC
Best regards
Percy Jahn
--Boundary-00=_GRv8DLNWc6FA9BE
Content-Type: application/octet-stream
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
filename="hoststateflapping.patch"
LS0tIG5hZ2lvcy1jdnMvaW5jbHVkZS9vYmplY3RzX29yZy5oCTIwMDYtMDItMTQgMTY6NTA6
MzAuMDAwMDAwMDAwICswMTAwCisrKyBuYWdpb3MtY3ZzL2luY2x1ZGUvb2JqZWN0cy5oCTIw
MDYtMDItMTQgMTY6NTE6MDIuMDAwMDAwMDAwICswMTAwCkBAIC0yMDEsNiArMjAxLDcgQEAg
dHlwZWRlZiBzdHJ1Y3QgaG9zdF9zdHJ1Y3R7CiAJZG91YmxlICBwZXJjZW50X3N0YXRlX2No
YW5nZTsKIAlpbnQgICAgIHRvdGFsX3NlcnZpY2VzOwogCXVuc2lnbmVkIGxvbmcgdG90YWxf
c2VydmljZV9jaGVja19pbnRlcnZhbDsKKwl1bnNpZ25lZCBsb25nIGF2Z19zZXJ2aWNlX2No
ZWNrX2ludGVydmFsOwogCXVuc2lnbmVkIGxvbmcgbW9kaWZpZWRfYXR0cmlidXRlczsKIAlp
bnQgICAgIGNpcmN1bGFyX3BhdGhfY2hlY2tlZDsKIAlpbnQgICAgIGNvbnRhaW5zX2NpcmN1
bGFyX3BhdGg7Ci0tLSBuYWdpb3MtY3ZzL2Jhc2UvZmxhcHBpbmdfb3JnLmMJMjAwNi0wMi0x
NCAxNjo0OTozOS4wMDAwMDAwMDAgKzAxMDAKKysrIG5hZ2lvcy1jdnMvYmFzZS9mbGFwcGlu
Zy5jCTIwMDYtMDItMTQgMTY6NTI6MDUuMDAwMDAwMDAwICswMTAwCkBAIC0xNzUsNyArMTc1
LDcgQEAgdm9pZCBjaGVja19mb3JfaG9zdF9mbGFwcGluZyhob3N0ICpoc3QsIAogCQl3YWl0
X3RocmVzaG9sZD1oc3QtPm5vdGlmaWNhdGlvbl9pbnRlcnZhbCppbnRlcnZhbF9sZW5ndGg7
CiAJCiAJZWxzZQotCQl3YWl0X3RocmVzaG9sZD0oaHN0LT50b3RhbF9zZXJ2aWNlX2NoZWNr
X2ludGVydmFsKmludGVydmFsX2xlbmd0aCkvaHN0LT50b3RhbF9zZXJ2aWNlczsKKwkJd2Fp
dF90aHJlc2hvbGQ9aHN0LT5hdmdfc2VydmljZV9jaGVja19pbnRlcnZhbCppbnRlcnZhbF9s
ZW5ndGg7CiAKIAkvKiBpZiB3ZSBoYXZlbid0IHdhaXRlZCBsb25nIGVub3VnaCBzaW5jZSBs
YXN0IHJlY29yZCwgb25seSB1cGRhdGUgaWYgd2UndmUgaGFkIGEgc3RhdGUgY2hhbmdlICov
CiAJaWYoKGN1cnJlbnRfdGltZS1oc3QtPmxhc3Rfc3RhdGVfaGlzdG9yeV91cGRhdGUpPHdh
aXRfdGhyZXNob2xkKXsKLS0tIG5hZ2lvcy1jdnMvYmFzZS9jb25maWdfb3JnLmMJMjAwNi0w
Mi0xNCAxNjo1MjoyMS4wMDAwMDAwMDAgKzAxMDAKKysrIG5hZ2lvcy1jdnMvYmFzZS9jb25m
aWcuYwkyMDA2LTAyLTE0IDE3OjAyOjEwLjAwMDAwMDAwMCArMDEwMApAQCAtMTYzNiw3ICsx
NjM2LDcgQEAgaW50IHByZV9mbGlnaHRfY2hlY2sodm9pZCl7CiAJaW50IHRvdGFsX29iamVj
dHM9MDsKIAlpbnQgd2FybmluZ3M9MDsKIAlpbnQgZXJyb3JzPTA7Ci0KKwl1bnNpZ25lZCBs
b25nIGF2Z19pbnRlcnZhbF9jb3VudGVyPTA7CiAjaWZkZWYgREVCVUcwCiAJcHJpbnRmKCJw
cmVfZmxpZ2h0X2NoZWNrKCkgc3RhcnRcbiIpOwogI2VuZGlmCkBAIC0yODEyLDcgKzI4MTIs
MTggQEAgaW50IHByZV9mbGlnaHRfY2hlY2sodm9pZCl7CiAJCQl0ZW1wX2hvc3QtPnRvdGFs
X3NlcnZpY2VfY2hlY2tfaW50ZXJ2YWwrPXRlbXBfc2VydmljZS0+Y2hlY2tfaW50ZXJ2YWw7
CiAJCSAgICAgICAgfQogCSAgICAgICAgfQotCisJLyogY2FsY3VsYXRlIGF2ZXJhZ2Ugc2Vy
dmljZSBjaGVjayBpbnRlcnZhbCAqLworCWZvcih0ZW1wX3NlcnZpY2U9c2VydmljZV9saXN0
O3RlbXBfc2VydmljZSE9TlVMTDt0ZW1wX3NlcnZpY2U9dGVtcF9zZXJ2aWNlLT5uZXh0KXsK
KwkJaWYoKHRlbXBfaG9zdD1maW5kX2hvc3QodGVtcF9zZXJ2aWNlLT5ob3N0X25hbWUpKSl7
CisJCQlpZih0ZW1wX3NlcnZpY2UtPmNoZWNrX2ludGVydmFsICE9IDApIHsKKwkJCQlhdmdf
aW50ZXJ2YWxfY291bnRlciArPSB0ZW1wX2hvc3QtPnRvdGFsX3NlcnZpY2VfY2hlY2tfaW50
ZXJ2YWwgLyB0ZW1wX3NlcnZpY2UtPmNoZWNrX2ludGVydmFsOworCQkJCX0KKwkJCX0KKwkJ
fQorCWlmKGF2Z19pbnRlcnZhbF9jb3VudGVyID4gMCkgeworCQl0ZW1wX2hvc3QtPmF2Z19z
ZXJ2a
...[email truncated]...
This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]