Possibly, the default for notification on a potential problem is to recheck every minute for 5 times before sending a notification. Of course your setup may not be using the defaultsglobalive.nagios wrote: One other note...I don't see any notifications sent out in the log (maybe just a config issue...?).
How do I diagnose 'socket timeout' issues?
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: How do I diagnose 'socket timeout' issues?
-
globalive.nagios
Re: How do I diagnose 'socket timeout' issues?
Ah, yeah, that would make sense. Thanks for clearing that part up.
Any ideas on the rest of it?
Any ideas on the rest of it?
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: How do I diagnose 'socket timeout' issues?
If this only happens to one department, I would have that talk with them about their network.globalive.nagios wrote:Okie dokie, I'll look more closely into DNS issues next time something comes up, and talk to the department about any peculiarities of their network. That should go over well.![]()
It could be a malfunctioning router,switch, hub, etc. The error you describe could be bad packet loss on a switch for the department which would explain why it only happens to that department.
-
globalive.nagios
Re: How do I diagnose 'socket timeout' issues?
Unfortunately the two hosts that come up with these errors most often are on different subnets, although both are VMs.
What I find most perplexing is that only one service will report a timeout - the others on the host are reporting fine. What would explain that?
What I find most perplexing is that only one service will report a timeout - the others on the host are reporting fine. What would explain that?
-
globalive.nagios
Re: How do I diagnose 'socket timeout' issues?
Haha, okay, I think we have a solution. Once I clued in that these were both VMs, I checked out the VM performance, and sure enough, memory balloon was 5-10%!! (for those not in the know, memory balloon = paging to disk, and is REALLY bad for performance)
At this point I think we can consider this a non-issue until we fix that issue. At least now you have another option for your troubleshooting list!
"Question 5b: Does your VM performance suck?"
At this point I think we can consider this a non-issue until we fix that issue. At least now you have another option for your troubleshooting list!
"Question 5b: Does your VM performance suck?"