We have a large Nagios3 + Puppet setup. We use a python script that queries Puppetdb for Nagios resources and generates Nagios config files. A few times a day we restart Nagios to pick up the new files. Since we have ~100k service checks, Nagios goes blind for 1 a 2 minutes during a restart. That's bad
We are using Docker / Mesos and lots of dynamic stuff. Which we want to start monitoring as it happens. So we want to be able to change the Nagios config once a minute or preferred at realtime. It seemed to me Nagios3 will not cut it. People have advised us to migrate to Nagios4 or other monitoring solutions. That's probably what we will do, but I wanted to try something different.
So I have created a NEB that adds a new service check without requiring a restart.
The proof-of-concept code is here: https://gist.github.com/rtoma/3fb1464de ... 5d9e3c0ad5
Is this something other Nagios developers would like to improve upon? It would be great to have a NEB module offering a HTTP API to add hosts / services at runtime.
I am wondering why no one (afaik) has tried something similar.
Regards,
Renzo