Page 2 of 3
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 1:54 pm
by bwallace
Thanks, there are 2 PPIDs so this can cause an array of problems....
this one:
nagios 12134 1 0 Jul05 ? 00:12:33 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
And this one:
nagios 21313 1 0 Jun01 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
To resolve, run these commands, wait awhile to see if the emails stop, and let us know of the outcome:
Code: Select all
service nagios stop
killall -9 nagios
service nagios start
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 2:17 pm
by lee.krause
# ps -ef | grep bin/nagios | grep -v grep
nagios 13667 1 2 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 13669 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 13670 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 13671 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 13672 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
Only 1 process now. I'll keep an eye out for emails.
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 2:20 pm
by bwallace
Yeah that looks good now - we'll stay tuned -
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 2:41 pm
by lee.krause
The messages are still being sent.
# ps -ef | grep bin/nagios | grep -v grep
nagios 15319 1 17 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15321 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15322 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15323 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15324 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15333 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
# service nagios stop
Stopping nagios:. done.
# ps -ef | grep bin/nagios | grep -v grep
#
# service nagios start
Starting nagios: done.
# ps -ef | grep bin/nagios | grep -v grep
nagios 15741 1 21 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15743 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15744 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15745 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15746 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15751 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
It seems to start 2 processes.
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 3:01 pm
by tgriep
2 nagios processes are normal. 1 parent and one child.
The problem is when you get 2 parents and 2 children, when you login to the XI GUI, you don't know which one you are connecting to and also which one is running the checks and or sending notifications.
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 3:40 pm
by lee.krause
I'm still getting emails with current time and date.
***** Nagios XI Alert *****
Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: Memory Usage
Host: xxxxxxxxxxx
State: CRITICAL
Info:
CHECK_NRPE: Error - Could not complete SSL handshake.
Date/Time: 2016-07-06 20:39:58
Just came in.
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 3:45 pm
by tgriep
Are there 4 Nagios processes running now instead of 2?
Is that notification valid, does the status reflect it?
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 3:55 pm
by lee.krause
Nope the same:
# ps -ef | grep bin/nagios | grep -v grep
nagios 15741 1 0 14:40 ? 00:00:44 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15743 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15744 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15745 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15746 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15751 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
No the status is Green and not showing the error.
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 4:33 pm
by tgriep
Try disabling notifications for that service. Make sure they stop sending, then re-enable notifications and see if it starts up again.
Re: Excessive notification emails
Posted: Wed Jul 06, 2016 4:41 pm
by lee.krause
This is happening for several services, not just one.
Since I restarted the Nagios service, I've gotten 316 emails. Each for a different service.