I recently upgraded nagios from 3.5.1 to 4.1.1.
Those "core workers" are a great thing for sure, many thanks for that great feature!
The only problem is that a lot of informational messages about workers are logged in the "nagios.log" file and now that file is like 50x the size it was with nagios 3.x.
This makes the browser hangs for a few seconds when accessing the "showlog.cgi". that's very annoying.
I'm currently monitoring a few thousands hosts with a few ten-thousands services, so it happens that sometimes a few of them are down/critical.
In that case I get a lot of these msgs every second:
Code: Select all
Informational Message[18-04-2016 15:16:48] wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62;
Informational Message[18-04-2016 15:16:48] wproc: host=V000CAM012; service=(null);
Informational Message[18-04-2016 15:16:48] wproc: CHECK job 8 from worker Core Worker 306 timed out after 30.00s
Informational Message[18-04-2016 15:16:48] wproc: Core Worker 306: job 8 (pid=365) timed out. Killing itI've started to change a few things in the source files...
-lib/worker.c
-base/workers.c
commenting out "wlog" strings and recompiling the program reduced the logging about wprocs, but I don't like doing that.
The best way would be an option in the nagios.cfg file, something like "workers_logging_level = [none,info,warning,critical,debug]..."
please let me know if there's a solution for that. your help is very appreciated.
regards,
Federico