Long apply configurations
Posted: Thu Sep 29, 2016 2:11 am
Hello,
Our apply configurations are taking longer and longer. It can take up to 40 seconds before Nagios XI is back online. Looking around the web I see competing Nagios clones which implemented a system where a parent process is spawned which takes over the monitoring etc. The new configuration is loaded in a duplicate child process. When the new configuration is loaded compeletely, the parent process with the old configuration is killed and the new process takes over resulting in a supposed 'downtime' of only 3-5 seconds.
Is this a feature which can be implemented in Nagios XI? Honestly, the long apply configurations are one of the most annoying features of Nagios XI. During the apply configuration process, there is a Window of 15-20 seconds where the Nagios hosts and services are no longer visible. Then there is a window of 10 seconds where hosts and services which were in downtime / acknowledged are visible in the open service problems views. This results in very confusing situations with duplicate calls and frustrated colleagues.
I understand my Nagios XI instance is bigger then the average, but we really need a better and more consistent solution for the apply configuration process. Please realize about 10 - 20 apply's are done each day resulting in 10-20 timeframes of 40 seconds where our views and dashboards are flashing or not showing anything at all or showing problems that already have been acknowledged.
Thanks for looking into this.
Willem
Our apply configurations are taking longer and longer. It can take up to 40 seconds before Nagios XI is back online. Looking around the web I see competing Nagios clones which implemented a system where a parent process is spawned which takes over the monitoring etc. The new configuration is loaded in a duplicate child process. When the new configuration is loaded compeletely, the parent process with the old configuration is killed and the new process takes over resulting in a supposed 'downtime' of only 3-5 seconds.
Is this a feature which can be implemented in Nagios XI? Honestly, the long apply configurations are one of the most annoying features of Nagios XI. During the apply configuration process, there is a Window of 15-20 seconds where the Nagios hosts and services are no longer visible. Then there is a window of 10 seconds where hosts and services which were in downtime / acknowledged are visible in the open service problems views. This results in very confusing situations with duplicate calls and frustrated colleagues.
I understand my Nagios XI instance is bigger then the average, but we really need a better and more consistent solution for the apply configuration process. Please realize about 10 - 20 apply's are done each day resulting in 10-20 timeframes of 40 seconds where our views and dashboards are flashing or not showing anything at all or showing problems that already have been acknowledged.
Thanks for looking into this.
Willem