Some checks are being duplicated over time
Posted: Wed Dec 04, 2013 3:17 am
Good morning
* Symptoms: Some service check are being duplicated over time (can be trigger by reschedule + reload) which lead to more load on nagios, and trigger false alarm when using check that calculate rates
* Questions
1) Is there some other people coping with this problem, I am running several nagios version 3.4.1, 3.5.1 and problem can be reproduced everywhere ?
* Problem:
- After upgrading from nagios 3.2 to 3.4 to solve some bugs, we did cope with the fact that some check are being duplicated over time until nagios is restarted
- I did test nagios 3.5.1 and problem is still there, I did answer to bug tracker n°0000388 too quickly
* Steps to reproduce
1) Have a check running periodically (5 min or 60s)
2) Reschedule the check manually (from web interface or nagios.cmd pipe file)
3) Reload nagios
=> The check will then occur twice during the timeperiod
Ex: Check is supposed to occur each 60s only, but after reschedule + reload, it runs more often
[Tue Dec 3 16:42:36 2013.099199] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.099000 sec
[Tue Dec 3 16:43:07 2013.131875] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.131000 sec
[Tue Dec 3 16:43:36 2013.155755] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.155000 sec
[Tue Dec 3 16:44:07 2013.186475] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.186000 sec
* Related Bug Tracker
- "0000388: multiple checks on a same service after reception of an external command PROCESS_SERVICE_CHECK_RESULT". http://tracker.nagios.org/view.php?id=388
=> I have it closed by mistake, problem is still there on nagios 3.5.1 but need a nagios reload
=> I am trying to get it reopened to correct it
* Topic that might be Related on this forum
- "Duplicate alert emails." http://support.nagios.com/forum/viewtop ... =7&t=22192
=> already posted an update there, no hint
- "dupliate host and service checks" http://support.nagios.com/forum/viewtop ... eck#p76338
=> problem seems different
* Next Steps:
- Test nagios 4.x
- Have mantis bug tracker reopened if possible
Thanks & Best Regards
* Symptoms: Some service check are being duplicated over time (can be trigger by reschedule + reload) which lead to more load on nagios, and trigger false alarm when using check that calculate rates
* Questions
1) Is there some other people coping with this problem, I am running several nagios version 3.4.1, 3.5.1 and problem can be reproduced everywhere ?
* Problem:
- After upgrading from nagios 3.2 to 3.4 to solve some bugs, we did cope with the fact that some check are being duplicated over time until nagios is restarted
- I did test nagios 3.5.1 and problem is still there, I did answer to bug tracker n°0000388 too quickly
* Steps to reproduce
1) Have a check running periodically (5 min or 60s)
2) Reschedule the check manually (from web interface or nagios.cmd pipe file)
3) Reload nagios
=> The check will then occur twice during the timeperiod
Ex: Check is supposed to occur each 60s only, but after reschedule + reload, it runs more often
[Tue Dec 3 16:42:36 2013.099199] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.099000 sec
[Tue Dec 3 16:43:07 2013.131875] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.131000 sec
[Tue Dec 3 16:43:36 2013.155755] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.155000 sec
[Tue Dec 3 16:44:07 2013.186475] [008.0] [pid=17504] ** Service Check Event ==> Host: 'localhost', Service: 'Current Users', Options: 0, Latency: 0.186000 sec
* Related Bug Tracker
- "0000388: multiple checks on a same service after reception of an external command PROCESS_SERVICE_CHECK_RESULT". http://tracker.nagios.org/view.php?id=388
=> I have it closed by mistake, problem is still there on nagios 3.5.1 but need a nagios reload
=> I am trying to get it reopened to correct it
* Topic that might be Related on this forum
- "Duplicate alert emails." http://support.nagios.com/forum/viewtop ... =7&t=22192
=> already posted an update there, no hint
- "dupliate host and service checks" http://support.nagios.com/forum/viewtop ... eck#p76338
=> problem seems different
* Next Steps:
- Test nagios 4.x
- Have mantis bug tracker reopened if possible
Thanks & Best Regards