Service Status CRITICAL

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
abasha
Posts: 21
Joined: Mon Aug 19, 2013 12:12 am

Service Status CRITICAL

Post by abasha »

Nagios Support,

We are using Nagios XI 2012R2.9 , For most of our windows servers that are being monitored by nagios,
we see the services changing the status to critical,I am herewith mentioned below the messages.

CRITICAL - Socket timeout after 10 seconds
CRITICAL - 172.23.10.115: rta nan, lost 100%

I am herewith attached the screenshot of the messages for your review.

Please advice how do we resolve these errors. Please let me know if you need any additional information.

Best Regards,
Afsal Basha.
You do not have the required permissions to view the files attached to this post.
Stuart Watts
Posts: 40
Joined: Wed Sep 25, 2013 7:01 am

Re: Service Status CRITICAL

Post by Stuart Watts »

Those errors smack of network issues to me - your Ping service is losing 100% of packets, which isn't good! Those 4 services for that machine are also "flapping" (switching between OK and not OK a lot), which may indicate a troublesome network connection to that machine.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Service Status CRITICAL

Post by slansing »

I would agree with Stuart, for whatever reason you are having trouble reaching the server reliably. I would start with networking, and work down to this:

http://assets.nagios.com/downloads/nagi ... utions.pdf
abasha
Posts: 21
Joined: Mon Aug 19, 2013 12:12 am

Re: Service Status CRITICAL

Post by abasha »

Thanks Stuart and slansing,
I have increased the timeout settings for nrpe_check

Could you kindly tell me the recommended settings for the PING checks esp $ARG1$ & $ARG2$

Best Regards,
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: Service Status CRITICAL

Post by sreinhardt »

We would need you to show us the command and service definitions so that we may know what the arguments are in reference to, they could be setup in any number of different ways.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
abasha
Posts: 21
Joined: Mon Aug 19, 2013 12:12 am

Re: Service Status CRITICAL

Post by abasha »

As requested, I am herewith attached commands.cfg and the screenshot of the service definition for ping check.Please let me know if you need any specific details.

Regards,



Mod Note: Removed bump from OP, if you have additional information feel free to add it to your previous message if you are the last poster. Bumping only serves to place you lower on the reply list...
You do not have the required permissions to view the files attached to this post.
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: Service Status CRITICAL

Post by sreinhardt »

The command portion of the service is blanked out, I am guessing this is due to a template in use? Thanks for the commands file, but it is hard to place which command you are using with that service still.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
Locked