Search found 44 matches

by jpelley
Fri Jul 08, 2016 1:30 pm
Forum: Nagios XI
Topic: ndo2db & rsyslogd ERRORS
Replies: 10
Views: 1629

Re: ndo2db & rsyslogd ERRORS

I still have the NDO messages and the rsyslog messages, the only difference is Nagios isn't hung...
by jpelley
Fri Jul 08, 2016 7:00 am
Forum: Nagios XI
Topic: ndo2db & rsyslogd ERRORS
Replies: 10
Views: 1629

Re: ndo2db & rsyslogd ERRORS

Specifically what happened was that we only saw 1000 out of our 6000 hosts within the XI home screen and the Monitoring Engine Status was red, and no amount of restarting it would turn it on (nor did restarting nagios service). Troubleshooting the difference between core and XI is very interesting a...
by jpelley
Thu Jul 07, 2016 1:41 pm
Forum: Nagios XI
Topic: ndo2db & rsyslogd ERRORS
Replies: 10
Views: 1629

Re: ndo2db & rsyslogd ERRORS

INRE to mysql, yes, after reboot, logentries table was broken, ran the repair script just after 10AM to fix (not sure which came first, chicken or the egg) [root@usalfd0nagxi01 local]# ipcs -q ------ Message Queues -------- key msqid owner perms used-bytes messages 0xd5000002 32768 nagios 600 0 0 [r...
by jpelley
Thu Jul 07, 2016 12:40 pm
Forum: Nagios XI
Topic: ndo2db & rsyslogd ERRORS
Replies: 10
Views: 1629

Re: ndo2db & rsyslogd ERRORS

INRE to uploading the profile.zip "The file is too big, maximum allowed size is 1 MiB"

I've checked, and the my profile.zip is 1.30MB
by jpelley
Thu Jul 07, 2016 10:13 am
Forum: Nagios XI
Topic: ndo2db & rsyslogd ERRORS
Replies: 10
Views: 1629

ndo2db & rsyslogd ERRORS

Nagios environment has been less than stable lately, this morning Nagios was hung and the following were the prevailing errors in /var/log/messages: Jul 7 08:52:56 usalfd0nagxi01 ndo2db: Error: queue recv error. Jul 7 08:52:56 usalfd0nagxi01 rsyslogd-2177: imuxsock begins to drop messages from pid 2...
by jpelley
Tue May 03, 2016 2:22 pm
Forum: Nagios XI
Topic: Orphaned Host Checks
Replies: 2
Views: 234

Re: Orphaned Host Checks

reboot fixed it, just required more patience than I possess
by jpelley
Tue May 03, 2016 1:55 pm
Forum: Nagios XI
Topic: Orphaned Host Checks
Replies: 2
Views: 234

Orphaned Host Checks

using regional gearman workers and have been running pretty stable for a couple months now, as of today and for no apparent reason we are getting orphaned host checks. Gearman_top looks clean, no disk space issues, have restarted nagios, (have seen examples in the past of multiple nagios PIDs confli...
by jpelley
Wed Jan 27, 2016 10:56 am
Forum: Nagios XI
Topic: SNMP based scripts no longer work after installing SNMP Trap
Replies: 8
Views: 1635

Re: SNMP based scripts no longer work after installing SNMP

I tested this out yesterday with little to no change. I was able to pull the /mib/ directory from a restore point from a week prior. When adding the mibs back I noticed they were all the same dates as they were before when they worked, i.e. Dec 16th, etc. It is important to note: I did not install t...
by jpelley
Fri Jan 22, 2016 2:38 pm
Forum: Nagios XI
Topic: SNMP based scripts no longer work after installing SNMP Trap
Replies: 8
Views: 1635

Re: SNMP based scripts no longer work after installing SNMP

Ran the command, traps still aren't processing, and more importantly existing SNMP based service checks are still broken. I appreciate any and all help on this issue. Sample output of a check that was working 2 days ago: [root@usalfd0nagxi01 ~]# snmpget -v2c -c <community string> 10.132.10.101 1.3.6...
by jpelley
Fri Jan 22, 2016 8:14 am
Forum: Nagios XI
Topic: SNMP based scripts no longer work after installing SNMP Trap
Replies: 8
Views: 1635

Re: SNMP based scripts no longer work after installing SNMP

[root@usalfd0nagxi01 ~]# ls -l /usr/share/ total 492 drwxr-xr-x. 2 root root 4096 Oct 15 04:57 aclocal drwxr-xr-x. 2 root root 4096 Sep 29 15:30 aclocal-1.11 drwxr-xr-x. 2 root root 4096 Sep 29 15:38 ajaxterm drwxr-xr-x. 4 root root 4096 Sep 29 15:34 alsa drwxr-xr-x. 3 root root 4096 Sep 29 11:41 a...