I/O wait issue

This support forum board is for support questions relating to Nagios Network Analyzer, our network traffic and bandwidth analysis solution.
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

I/O wait issue

Post by snapon_admin »

Noticing a high percentage of I/O wait on our NNA server and wondering what options I have to reduce it. Our server has ~100-120 sources, 8 CPU cores (pretty sure it's 2 quad cores but it might be 4 dual cores), 8GB RAM, using the vSphere OVF template. This is what I'm looking at:
IO wait NNA.png
What do?
You do not have the required permissions to view the files attached to this post.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: I/O wait issue

Post by scottwilkerson »

Does this machine share drives with any other machines (e.g. SAN or VM's)?

Does the server have spinning drives or SSD's?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

Re: I/O wait issue

Post by snapon_admin »

Ahh, good thought. I'll have to ask our VM admins as they own it, but I'd be willing to bet it's sharing drive space with other VMs and that it's likely not running on SSDs. I will check and get back to you. Not sure why I didn't think of the most obvious thing first....thanks!
cmerchant
Posts: 546
Joined: Wed Sep 24, 2014 11:19 am

Re: I/O wait issue

Post by cmerchant »

Let us know what you find out. Thanks.
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

Re: I/O wait issue

Post by snapon_admin »

Looks like, as I suspected, our VMs are all on shared SAN storage. I don't see it being possible for that to be changed, so is there anything I can do to bring the I/O wait down at all?
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: I/O wait issue

Post by sreinhardt »

Are you able to correlate those spikes to anything in particular? Maybe when reports are being run, heavy user usage, maybe spikes in network traffic being pumped to NNA? The seem to be for relatively short (5-15 minute) windows and only when they overlap is it really hard on your system for longer periods of time.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

Re: I/O wait issue

Post by snapon_admin »

Well I disabled the Network analyzer tab in XI, the one that shows up when you click on a host or service, and ever since the I/O spikes have almost completely disappeared. I'm assuming people were click on it, possibly not knowing what it even did, and since that tab runs a query against ALL sources on the NNA server it was causing the issue. Ever since I disabled the tab, there has still been a load spike at almost the same time both days. At around 8-8:30 am yesterday and the day before (after I disabled the tab) the I/O wait shoots up to about 30ish for an hour. After that all is well.
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: I/O wait issue

Post by sreinhardt »

That makes a lot of sense as to why the increased load. Seems like around the time people start coming into the office it spikes some, well depending on your hours. If you have reports scheduled that make use of that data in xi or NNA it could be doing this as well.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

Re: I/O wait issue

Post by snapon_admin »

I forget the command, but isn't there a command I can run that will tell me all the scheduled reports that exist on the server?
cmerchant
Posts: 546
Joined: Wed Sep 24, 2014 11:19 am

Re: I/O wait issue

Post by cmerchant »

It's run on the cron job by apache. To see if you have any scheduled reports, you can view the apache crontab:

Code: Select all
crontab -u apache -l
That was from a forum post: http://support.nagios.com/forum/viewtop ... 16&t=28472
Locked