Slowness troubleshooting --> 5.4.11 to 5.4.13.
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Gearman was removed back in November of last year due to this very problem causing gearman checks to fail.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
We may need to increase the time nagios waits during a restart, here's a doc outlining the procedure
https://support.nagios.com/kb/article.php?id=172
https://support.nagios.com/kb/article.php?id=172
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
In production I had 1..40 which ahs been around since we originally implemented gearman and had those issues so I know this won't solve our issue. Our test environment still had 1 2 3 4 5 6 7 8 9 10 but test isn't really doing much and there are about 3 users that use it which also includes myself.
I don't believe this will help.
I don't believe this will help.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
I'm not really sure how to proceed, does the date of the extra parent processed correspond to anything particular like the day you upgraded?
We are going to have to watch these server to see when this occurs and try to grab the logs (system logs) or profile as close as possible to the time it occured
We are going to have to watch these server to see when this occurs and try to grab the logs (system logs) or profile as close as possible to the time it occured
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Well our test environment has 2 parents right now.
nagios 11928 1 0 16:12 ? 00:00:05 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 12140 11928 0 16:12 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29181 1 0 14:23 ? 00:00:09 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29559 29181 0 14:24 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
I'll tar up the logs and PM them?
nagios 11928 1 0 16:12 ? 00:00:05 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 12140 11928 0 16:12 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29181 1 0 14:23 ? 00:00:09 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 29559 29181 0 14:24 ? 00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
I'll tar up the logs and PM them?
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
That would be great a profile zip as well please preferably before killing the extra process (Admin -> System Profile)
Thanks!
Also, it looks like the new process started at 16:12, do you know if that correlates to anything specific, or just a standard apply configuration?
Thanks!
Also, it looks like the new process started at 16:12, do you know if that correlates to anything specific, or just a standard apply configuration?
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
We are apparantly not allowed to send documents of greated than 1M via PM.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
Sorry, I modified the limit you should be able to upload nowemartine wrote:We are apparantly not allowed to send documents of greated than 1M via PM.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Slowness troubleshooting --> 5.4.11 to 5.4.13.
I am awaiting your profile, what OS is this?scottwilkerson wrote:That would be great a profile zip as well please preferably before killing the extra process (Admin -> System Profile)
Thanks!
Also, it looks like the new process started at 16:12, do you know if that correlates to anything specific, or just a standard apply configuration?
Oddly the messages log you send only went until Apr 10 11:36:26 which would not include the start at process 16:12