Slowness troubleshooting --> 5.4.11 to 5.4.13.
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Profile sent.
This is RHEL 6.9
At this time I see these:
nagios 1926 1 0 10:46 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 2080 1926 0 10:46 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 11928 1 0 Apr10 ? 00:00:10 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 12140 11928 0 Apr10 ? 00:00:03 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29181 1 0 Apr10 ? 00:00:09 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29559 29181 0 Apr10 ? 00:00:03 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
I'll tar up another set of logs in a few.
This is RHEL 6.9
At this time I see these:
nagios 1926 1 0 10:46 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 2080 1926 0 10:46 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 11928 1 0 Apr10 ? 00:00:10 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 12140 11928 0 Apr10 ? 00:00:03 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29181 1 0 Apr10 ? 00:00:09 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29559 29181 0 Apr10 ? 00:00:03 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
I'll tar up another set of logs in a few.
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
New tar of logs sent.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
I have looked over everything you sent but there is nothing in there that can make me conclusively say why the processes aren't getting killed off like they should be.
What I can say for 100% fact is that there should not be 2 parent processes, and it can adversely affect the system so I am going to recommend making the following change to make sure it doesn't continue
Edit /etc/init.d/nagios on the server and liik for this line
Just below that like I would like you to add the following:
Then restart nagios
What I can say for 100% fact is that there should not be 2 parent processes, and it can adversely affect the system so I am going to recommend making the following change to make sure it doesn't continue
Edit /etc/init.d/nagios on the server and liik for this line
Code: Select all
echo 'Warning - nagios did not exit in a timely manner'Code: Select all
/usr/bin/killall -9 nagiosCode: Select all
service nagios restartRe: Slowness troubleshooting --> 5.4.11 to 5.4.13.
I just implemented that. I made a change, hit appy and noticed that checks were not happening. I logged onto the server only to find that nagios was not running.
if status_nagios > /dev/null; then
echo ''
echo 'Warning - nagios did not exit in a timely manner'
/usr/bin/killall -9 nagios
else
echo ' done.'
fi
if status_nagios > /dev/null; then
echo ''
echo 'Warning - nagios did not exit in a timely manner'
/usr/bin/killall -9 nagios
else
echo ' done.'
fi
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Shortly after removing that line in the stanza I started nagios manually on the command line. I then went into the web interface, made a test service inactive, applied the configuration and now I see this.
ps -ef | grep nagios.cfg
nagios 29128 1 0 11:34 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29279 29128 0 11:34 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29551 1 0 11:35 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29814 29551 0 11:36 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
ps -ef | grep nagios.cfg
nagios 29128 1 0 11:34 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29279 29128 0 11:34 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29551 1 0 11:35 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29814 29551 0 11:36 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Something is really strange in the setup that is adds multiple parent processes every time you apply configuration
Can you PM me if you are available between 9am -2pm CDT tomorrow to schedule a time I may take a look at the system.
Thanks.
Can you PM me if you are available between 9am -2pm CDT tomorrow to schedule a time I may take a look at the system.
Thanks.
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Will do. I'll PM you around 10AM.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Sounds good. I await hearing from you
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Im available.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
resolved in remote session, added
killproc_nagios KILL
to the init script if the process didn't terminate in a timely manor
killproc_nagios KILL
to the init script if the process didn't terminate in a timely manor