How to check window and linux server unresponsive or freeze

This board serves as an open discussion and support collaboration point for Nagios XI. NOTE: Nagios XI customers should use the Customer Support forum to obtain expedited support.

How to check window and linux server unresponsive or freeze

Postby JosephH » Wed Aug 07, 2019 7:40 am

I have searched the forum and see if there is any existing plugin or function to check if a server is in freeze or unresponsive status. For example, the ping test is OK, the server maybe not response to any check command.
JosephH
 
Posts: 14
Joined: Fri Oct 12, 2018 10:04 am

Re: How to check window and linux server unresponsive or fre

Postby scottwilkerson » Wed Aug 07, 2019 7:45 am

This depends on the agent you are using but generally, this manifests itself as getting UNKNOWN or CRITICAL errors on all the service checks for the host you are checking.

If the server isn't responding to the check command, this would be a non-OK state for that service.

To directly answer the question though, there is no check to get the server to say it is unresponsive because it would not respond to those as well.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
User avatar
scottwilkerson
DevOps Engineer
 
Posts: 15747
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises

Re: How to check window and linux server unresponsive or fre

Postby JosephH » Thu Aug 15, 2019 8:36 am

hi scottwilkerson,

Thank you for your reply. I got your point.
I am using NCPA agent.
Based on one article https://techcommunity.microsoft.com/t5/ ... a-p/375577 for windows, one of them is to check if we can map or access \\serverName\C$. But not sure is this also apply to Linux or other OS. Is there existing plugins or check command for mapping drives?
Otherwise, any suggestions to say the server is unresponsive based on some existing checks? for example, we can combine Ping test and NCPA (get the agent version) checks to determine server unresponsive.
Trying to find a better way to check.

Thank you,
Joseph
JosephH
 
Posts: 14
Joined: Fri Oct 12, 2018 10:04 am

Re: How to check window and linux server unresponsive or fre

Postby scottwilkerson » Thu Aug 15, 2019 8:55 am

I would think if you cannot do a basic check to get the agent version, it is definitely a problem all on it's own and should be enough.

Sometimes servers respond to ping but are locked up and won't do anything else.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
User avatar
scottwilkerson
DevOps Engineer
 
Posts: 15747
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises

Re: How to check window and linux server unresponsive or fre

Postby JosephH » Thu Aug 22, 2019 8:28 am

Hi scottwilkerson,

NCPA agent passive check has one entry
%HOSTNAME%|__HOST__ = system/agent_version
is this required for all the passive checks or optional?
Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?

Thanks
Joseph
JosephH
 
Posts: 14
Joined: Fri Oct 12, 2018 10:04 am

Re: How to check window and linux server unresponsive or fre

Postby scottwilkerson » Thu Aug 22, 2019 9:26 am

JosephH wrote:is this required for all the passive checks or optional?

optional, this is the host check

JosephH wrote:Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?

You can only have one host check, but you can setup a Ping service check.

Easiest way is to run the Generic Network Device Wizard and use the same hostname.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
User avatar
scottwilkerson
DevOps Engineer
 
Posts: 15747
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises


Return to Nagios XI

Who is online

Users browsing this forum: Google [Bot] and 13 guests