How to check window and linux server unresponsive or freeze

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
JosephH
Posts: 34
Joined: Fri Oct 12, 2018 10:04 am

How to check window and linux server unresponsive or freeze

Post by JosephH »

I have searched the forum and see if there is any existing plugin or function to check if a server is in freeze or unresponsive status. For example, the ping test is OK, the server maybe not response to any check command.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: How to check window and linux server unresponsive or fre

Post by scottwilkerson »

This depends on the agent you are using but generally, this manifests itself as getting UNKNOWN or CRITICAL errors on all the service checks for the host you are checking.

If the server isn't responding to the check command, this would be a non-OK state for that service.

To directly answer the question though, there is no check to get the server to say it is unresponsive because it would not respond to those as well.
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
JosephH
Posts: 34
Joined: Fri Oct 12, 2018 10:04 am

Re: How to check window and linux server unresponsive or fre

Post by JosephH »

hi scottwilkerson,

Thank you for your reply. I got your point.
I am using NCPA agent.
Based on one article https://techcommunity.microsoft.com/t5/ ... a-p/375577 for windows, one of them is to check if we can map or access \\serverName\C$. But not sure is this also apply to Linux or other OS. Is there existing plugins or check command for mapping drives?
Otherwise, any suggestions to say the server is unresponsive based on some existing checks? for example, we can combine Ping test and NCPA (get the agent version) checks to determine server unresponsive.
Trying to find a better way to check.

Thank you,
Joseph
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: How to check window and linux server unresponsive or fre

Post by scottwilkerson »

I would think if you cannot do a basic check to get the agent version, it is definitely a problem all on it's own and should be enough.

Sometimes servers respond to ping but are locked up and won't do anything else.
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
JosephH
Posts: 34
Joined: Fri Oct 12, 2018 10:04 am

Re: How to check window and linux server unresponsive or fre

Post by JosephH »

Hi scottwilkerson,

NCPA agent passive check has one entry
%HOSTNAME%|__HOST__ = system/agent_version
is this required for all the passive checks or optional?
Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?

Thanks
Joseph
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: How to check window and linux server unresponsive or fre

Post by scottwilkerson »

JosephH wrote:is this required for all the passive checks or optional?
optional, this is the host check
JosephH wrote:Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?
You can only have one host check, but you can setup a Ping service check.

Easiest way is to run the Generic Network Device Wizard and use the same hostname.
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Locked