Monitoring Event Engine Queue
Posted: Tue Sep 26, 2017 1:28 pm
HI
we are seeing issues with the Monitoring Event Engine queue.
I am not seeing issues in the logs related to ndo or too much messages
Even though I am not seeing this "NDOUtils - Message Queue Exceeded", I followed the recommendations in the link below
https://support.nagios.com/kb/article.php?id=139
This is what I see
# ipcs -q
------ Message Queues --------
key msqid owner perms used-bytes messages
0x50020080 327680 nagios 600 361473024 353001
I dont see issues in mysql. The database is healthy, we dont seem to have corruption in any of the tables or anything like that
I do see JOBs being process, when I run gearman_top2 I see jobs running and being process. I also check the mod_gearman_worker logs and I see jobs being process there as well
I have also tried disableing mod_gearman in nagios.cfg, but didn't make difference
I have also tried restarting nagios, but again it doesnt make any difference
[1506450666] Nagios 4.2.4 starting... (PID=68547)
[1506450666] Local time is Tue Sep 26 14:31:06 EDT 2017
[1506450666] LOG VERSION: 2.0
[1506450666] qh: Socket '/usr/local/nagios/var/rw/nagios.qh' successfully initialized
[1506450666] qh: core query handler registered
[1506450666] nerd: Channel hostchecks registered successfully
[1506450666] nerd: Channel servicechecks registered successfully
[1506450666] nerd: Channel opathchecks registered successfully
[1506450666] nerd: Fully initialized and ready to rock!
[1506450666] wproc: Successfully registered manager as @wproc with query handler
[1506450666] wproc: Registry request: name=Core Worker 68549;pid=68549
[1506450666] wproc: Registry request: name=Core Worker 68550;pid=68550
[1506450666] wproc: Registry request: name=Core Worker 68551;pid=68551
[1506450666] wproc: Registry request: name=Core Worker 68552;pid=68552
[1506450666] mod_gearman: initialized version 2.1.1 (libgearman 0.33)
[1506450666] Event broker module '/usr/lib64/mod_gearman2/mod_gearman2.o' initialized successfully.
[1506450666] ndomod: NDOMOD 2.1.2 (11-14-2016) Copyright (c) 2009 Nagios Core Development Team and Community Contributors
[1506450666] ndomod: Successfully connected to data sink. 0 queued items to flush.
[1506450666] ndomod registered for process data
[1506450666] ndomod registered for log data'
[1506450666] ndomod registered for system command data'
[1506450666] ndomod registered for event handler data'
[1506450666] ndomod registered for notification data'
[1506450666] ndomod registered for comment data'
[1506450666] ndomod registered for downtime data'
[1506450666] ndomod registered for flapping data'
[1506450666] ndomod registered for program status data'
[1506450666] ndomod registered for host status data'
[1506450666] ndomod registered for service status data'
[1506450666] ndomod registered for adaptive program data'
[1506450666] ndomod registered for adaptive host data'
[1506450666] ndomod registered for adaptive service data'
[1506450666] ndomod registered for external command data'
[1506450666] ndomod registered for aggregated status data'
[1506450666] ndomod registered for retention data'
[1506450666] ndomod registered for contact data'
[1506450666] ndomod registered for contact notification data'
[1506450666] ndomod registered for acknowledgement data'
[1506450666] ndomod registered for state change data'
[1506450666] ndomod registered for contact status data'
[1506450666] ndomod registered for adaptive contact data'
[1506450666] Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully.
we are seeing issues with the Monitoring Event Engine queue.
I am not seeing issues in the logs related to ndo or too much messages
Even though I am not seeing this "NDOUtils - Message Queue Exceeded", I followed the recommendations in the link below
https://support.nagios.com/kb/article.php?id=139
This is what I see
# ipcs -q
------ Message Queues --------
key msqid owner perms used-bytes messages
0x50020080 327680 nagios 600 361473024 353001
I dont see issues in mysql. The database is healthy, we dont seem to have corruption in any of the tables or anything like that
I do see JOBs being process, when I run gearman_top2 I see jobs running and being process. I also check the mod_gearman_worker logs and I see jobs being process there as well
I have also tried disableing mod_gearman in nagios.cfg, but didn't make difference
I have also tried restarting nagios, but again it doesnt make any difference
[1506450666] Nagios 4.2.4 starting... (PID=68547)
[1506450666] Local time is Tue Sep 26 14:31:06 EDT 2017
[1506450666] LOG VERSION: 2.0
[1506450666] qh: Socket '/usr/local/nagios/var/rw/nagios.qh' successfully initialized
[1506450666] qh: core query handler registered
[1506450666] nerd: Channel hostchecks registered successfully
[1506450666] nerd: Channel servicechecks registered successfully
[1506450666] nerd: Channel opathchecks registered successfully
[1506450666] nerd: Fully initialized and ready to rock!
[1506450666] wproc: Successfully registered manager as @wproc with query handler
[1506450666] wproc: Registry request: name=Core Worker 68549;pid=68549
[1506450666] wproc: Registry request: name=Core Worker 68550;pid=68550
[1506450666] wproc: Registry request: name=Core Worker 68551;pid=68551
[1506450666] wproc: Registry request: name=Core Worker 68552;pid=68552
[1506450666] mod_gearman: initialized version 2.1.1 (libgearman 0.33)
[1506450666] Event broker module '/usr/lib64/mod_gearman2/mod_gearman2.o' initialized successfully.
[1506450666] ndomod: NDOMOD 2.1.2 (11-14-2016) Copyright (c) 2009 Nagios Core Development Team and Community Contributors
[1506450666] ndomod: Successfully connected to data sink. 0 queued items to flush.
[1506450666] ndomod registered for process data
[1506450666] ndomod registered for log data'
[1506450666] ndomod registered for system command data'
[1506450666] ndomod registered for event handler data'
[1506450666] ndomod registered for notification data'
[1506450666] ndomod registered for comment data'
[1506450666] ndomod registered for downtime data'
[1506450666] ndomod registered for flapping data'
[1506450666] ndomod registered for program status data'
[1506450666] ndomod registered for host status data'
[1506450666] ndomod registered for service status data'
[1506450666] ndomod registered for adaptive program data'
[1506450666] ndomod registered for adaptive host data'
[1506450666] ndomod registered for adaptive service data'
[1506450666] ndomod registered for external command data'
[1506450666] ndomod registered for aggregated status data'
[1506450666] ndomod registered for retention data'
[1506450666] ndomod registered for contact data'
[1506450666] ndomod registered for contact notification data'
[1506450666] ndomod registered for acknowledgement data'
[1506450666] ndomod registered for state change data'
[1506450666] ndomod registered for contact status data'
[1506450666] ndomod registered for adaptive contact data'
[1506450666] Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully.