Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
sreinhardt wrote:Lets get some basic system specs and load settings, ideally while its running sluggishly. This also may be easier to attach as a text message than typing in.
Your memory and process utilization don't seem bad, however your average load is through the roof. 100+ for all 1/5/15 minute intervals is extremely high. Lets get a few more stats and see what else might be going on. Also what total number of checks and intervals are you presently running?
top > /tmp/top (press ctrl+c after a few seconds to break it)
cat /tmp/top
iostat -x
Also note, holy cow man, mcafee and linuxshield appear to be kicking your systems butt. Did this happen to start when they both were installed?
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
It would seem that your largest issue is disk utilization followed closely by fluctuating CPU usage. Have you taken a look at offloading mysql and using a ramdisk for check results and performance data?
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
sreinhardt wrote:It would seem that your largest issue is disk utilization followed closely by fluctuating CPU usage. Have you taken a look at offloading mysql and using a ramdisk for check results and performance data?
I've tried offloading MySQL to another server, but was unable to get data to actually be sent to it. I was able to log into the MySQL server, from the Nagios server, so connectivity was there, but the db never grew. I tried/checked everything myself and coworker could think of, but nothing made a difference. The CPU usage is a new issue. Someone is testing out HBSS and it's hammering the Nagios server CPU. I've never had a RAM disk suggested to me, but I feel like I would need to get the other things resolved for a RAM disk to be fully effective. This Nagios server is a VMware image. Would simply adding a CPU or RAM help resolve this?
Well the fact that you have someone testing something on your production nagios server is a big indicator. Besides what we have suggested, and possibly adding more memory and another cpu or core there is not much else, it seems to be the current environment the server is in as spenser pointed out. It does not look like this is an issue with Nagios specifically.
Well the problem has been resolved. It was not anything Nagios related. I didn't think it was, but wanted to do my due-diligence. One of our VMware guys found that, although there was 4GB or RAM configured, there was a 512MB reserve set. Once the reserve was removed, the CPU usage dropped drastically. I appreciate all the help in confirming it was not in fact a problem with Nagios. I was able to report that this morning and the VMware guys had it resolved before lunch.
Fantastic, glad to hear its working! I'll lock this up then.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.