abrist wrote:Looks like it was a file lock issue then. This is good to know as others with large installs may experience the same problems. There was some chat around office about modernizing the mrtg approach in XI. We will see what happens.
It may be worthwhile to open up a bug report about this at: tracker.nagios.com
Yeah it definitely was the problem with the previous MRTG not finishing in the 5 minute time frame, and the following MRTG instance not able to run because of the lock file. We have a 4 vCPU Setup with modern XEON Processors underneath it, so its not the slowest hardware.
With the split up of the config file and multiple instance's + "Forks:" Option its finishing now within 15-30 seconds.
I did like to give some input regarding the bandwidth monitoring approach of NagiosXI.
1. When using the switch wizard it creates a mrtg.cfg with all the ports it discovered. So even if you deselect port's in the Webfrontend because you don't want
them to monitor, MRTG does it nonetheless, this is sub optimal especially for big environment's
2. More of a general problem, doubled I/O's for bandwidth monitoring because MRTG is saving the results in its rrd files and then Nagios is checking these files and
saves them in its own rrd files
3. I have noticed that with the default NagiosXI install, MRTG is installed without the "Forks:" option in the mrtg.cfg. This may be difficult to tweak depending on
the hardware, but it speeds up MRTG in large environments for the polling jobs because it forks it self for this.
Kind regards
Dominik