This support forum board is for support questions relating to
Nagios XI , our flagship commercial network monitoring solution.
bwallace
Posts: 1145 Joined: Tue Nov 17, 2015 1:57 pm
Post
by bwallace » Wed Jul 06, 2016 1:54 pm
Thanks, there are 2 PPIDs so this can cause an array of problems....
this one:
nagios 12134 1 0 Jul05 ? 00:12:33 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
And this one:
nagios 21313 1 0 Jun01 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
To resolve, run these commands, wait awhile to see if the emails stop, and let us know of the outcome:
Code: Select all
service nagios stop
killall -9 nagios
service nagios start
Be sure to check out the
Knowledgebase for helpful articles and solutions!
lee.krause
Posts: 86 Joined: Wed Jan 20, 2016 8:38 am
Post
by lee.krause » Wed Jul 06, 2016 2:17 pm
# ps -ef | grep bin/nagios | grep -v grep
nagios 13667 1 2 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 13669 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 13670 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 13671 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 13672 13667 0 14:16 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
Only 1 process now. I'll keep an eye out for emails.
bwallace
Posts: 1145 Joined: Tue Nov 17, 2015 1:57 pm
Post
by bwallace » Wed Jul 06, 2016 2:20 pm
Yeah that looks good now - we'll stay tuned -
Be sure to check out the
Knowledgebase for helpful articles and solutions!
lee.krause
Posts: 86 Joined: Wed Jan 20, 2016 8:38 am
Post
by lee.krause » Wed Jul 06, 2016 2:41 pm
The messages are still being sent.
# ps -ef | grep bin/nagios | grep -v grep
nagios 15319 1 17 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15321 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15322 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15323 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15324 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15333 15319 0 14:39 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
# service nagios stop
Stopping nagios:. done.
# ps -ef | grep bin/nagios | grep -v grep
#
# service nagios start
Starting nagios: done.
# ps -ef | grep bin/nagios | grep -v grep
nagios 15741 1 21 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15743 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15744 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15745 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15746 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15751 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
It seems to start 2 processes.
tgriep
Madmin
Posts: 9190 Joined: Thu Oct 30, 2014 9:02 am
Post
by tgriep » Wed Jul 06, 2016 3:01 pm
2 nagios processes are normal. 1 parent and one child.
The problem is when you get 2 parents and 2 children, when you login to the XI GUI, you don't know which one you are connecting to and also which one is running the checks and or sending notifications.
Be sure to check out our
Knowledgebase for helpful articles and solutions!
lee.krause
Posts: 86 Joined: Wed Jan 20, 2016 8:38 am
Post
by lee.krause » Wed Jul 06, 2016 3:40 pm
I'm still getting emails with current time and date.
***** Nagios XI Alert *****
Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: Memory Usage
Host: xxxxxxxxxxx
State: CRITICAL
Info:
CHECK_NRPE: Error - Could not complete SSL handshake.
Date/Time: 2016-07-06 20:39:58
Just came in.
tgriep
Madmin
Posts: 9190 Joined: Thu Oct 30, 2014 9:02 am
Post
by tgriep » Wed Jul 06, 2016 3:45 pm
Are there 4 Nagios processes running now instead of 2?
Is that notification valid, does the status reflect it?
Be sure to check out our
Knowledgebase for helpful articles and solutions!
lee.krause
Posts: 86 Joined: Wed Jan 20, 2016 8:38 am
Post
by lee.krause » Wed Jul 06, 2016 3:55 pm
Nope the same:
# ps -ef | grep bin/nagios | grep -v grep
nagios 15741 1 0 14:40 ? 00:00:44 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15743 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15744 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15745 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15746 15741 0 14:40 ? 00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios 15751 15741 0 14:40 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
No the status is Green and not showing the error.
tgriep
Madmin
Posts: 9190 Joined: Thu Oct 30, 2014 9:02 am
Post
by tgriep » Wed Jul 06, 2016 4:33 pm
Try disabling notifications for that service. Make sure they stop sending, then re-enable notifications and see if it starts up again.
Be sure to check out our
Knowledgebase for helpful articles and solutions!
lee.krause
Posts: 86 Joined: Wed Jan 20, 2016 8:38 am
Post
by lee.krause » Wed Jul 06, 2016 4:41 pm
This is happening for several services, not just one.
Since I restarted the Nagios service, I've gotten 316 emails. Each for a different service.