High Load Advice

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
btemple
Posts: 183
Joined: Fri Nov 08, 2013 9:30 am

High Load Advice

Post by btemple »

We are reaching the end of our full implementation of NAgios XI. We seem to be having high load issues. I was hoping to get some advice. What stats hardware specs can I post for you to take a look ?
I am pretty sure our load spikes are coming from the netapp checks that we run. We have tons of volumes and the response time is a 1-2 seconds per cehck. The checks are currently run ever ten minutes.
You do not have the required permissions to view the files attached to this post.
btemple
Posts: 183
Joined: Fri Nov 08, 2013 9:30 am

Re: High Load Advice

Post by btemple »

More info. There was an obvious change on Jan 18th. I believe this is when the final Net app filer was added to production.
You do not have the required permissions to view the files attached to this post.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: High Load Advice

Post by abrist »

First, we need some info about the server hardware:

Code: Select all

lscpu > /tmp/info.txt
cat /proc/cpuinfo >> /tmp/info.txt
free -m >> /tmp/info.txt
top -n 1 -b >> /tmp/info.txt
ps -aef >> /tmp/info.txt
Post the file /tmp/info.txt.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
btemple
Posts: 183
Joined: Fri Nov 08, 2013 9:30 am

Re: High Load Advice

Post by btemple »

Here you go thanks !
You do not have the required permissions to view the files attached to this post.
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: High Load Advice

Post by lmiltchev »

Are you currently using modgearman or livestatus? Can you post the nagios.cfg file? Have you tried following any of the steps, outlined in the "Maximizing Performance In Nagios XI" document?
Be sure to check out our Knowledgebase for helpful articles and solutions!
btemple
Posts: 183
Joined: Fri Nov 08, 2013 9:30 am

Re: High Load Advice

Post by btemple »

Not sure for question #1 where would I find the info ?
Answer # 2 attched
Will continue to work through #3
You do not have the required permissions to view the files attached to this post.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: High Load Advice

Post by scottwilkerson »

Lets also run the following so we can see which process is using the CPU, I noticed tripwire on here which I know doesn't come on a default XI install

Code: Select all

ps aux
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
btemple
Posts: 183
Joined: Fri Nov 08, 2013 9:30 am

Re: High Load Advice

Post by btemple »

See attached. Tripwire was installed due to separation of duties auditing
Last edited by scottwilkerson on Mon Feb 02, 2015 4:42 pm, edited 1 time in total.
Reason: deleted file as it contained some sensative info
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: High Load Advice

Post by scottwilkerson »

Are you still seeing similar high load?

One thing I noticed in your last post was that the check_netapp_ng_64 plugin was using a considerable amount of CPU (53.5%).

We didn't create this plugin, nor do we have a netapp server to test against, but it certainly could be a bug in the plugin.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
btemple
Posts: 183
Joined: Fri Nov 08, 2013 9:30 am

Re: High Load Advice

Post by btemple »

Load comes and goes. I would say it is still high. I originally posted that check as a possible root cause so I am pretty sure that is the issue.
Locked