Page 2 of 2
Re: How do I diagnose 'socket timeout' issues?
Posted: Wed Jan 04, 2012 3:03 pm
by scottwilkerson
globalive.nagios wrote:
One other note...I don't see any notifications sent out in the log (maybe just a config issue...?).
Possibly, the default for notification on a potential problem is to recheck every minute for 5 times before sending a notification. Of course your setup may not be using the defaults
Re: How do I diagnose 'socket timeout' issues?
Posted: Wed Jan 04, 2012 3:23 pm
by globalive.nagios
Ah, yeah, that would make sense. Thanks for clearing that part up.
Any ideas on the rest of it?
Re: How do I diagnose 'socket timeout' issues?
Posted: Wed Jan 04, 2012 4:08 pm
by scottwilkerson
globalive.nagios wrote:Okie dokie, I'll look more closely into DNS issues next time something comes up, and talk to the department about any peculiarities of their network. That should go over well.
If this only happens to one department, I would have that talk with them about their network.
It could be a malfunctioning router,switch, hub, etc. The error you describe could be bad packet loss on a switch for the department which would explain why it only happens to that department.
Re: How do I diagnose 'socket timeout' issues?
Posted: Thu Jan 05, 2012 7:59 am
by globalive.nagios
Unfortunately the two hosts that come up with these errors most often are on different subnets, although both are VMs.
What I find most perplexing is that only one service will report a timeout - the others on the host are reporting fine. What would explain that?
Re: How do I diagnose 'socket timeout' issues?
Posted: Thu Jan 05, 2012 8:37 am
by globalive.nagios
Haha, okay, I think we have a solution. Once I clued in that these were both VMs, I checked out the VM performance, and sure enough, memory balloon was 5-10%!! (for those not in the know, memory balloon = paging to disk, and is REALLY bad for performance)
At this point I think we can consider this a non-issue until we fix that issue. At least now you have another option for your troubleshooting list!
"Question 5b: Does your VM performance suck?"