How to check window and linux server unresponsive or freeze
How to check window and linux server unresponsive or freeze
I have searched the forum and see if there is any existing plugin or function to check if a server is in freeze or unresponsive status. For example, the ping test is OK, the server maybe not response to any check command.
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: How to check window and linux server unresponsive or fre
This depends on the agent you are using but generally, this manifests itself as getting UNKNOWN or CRITICAL errors on all the service checks for the host you are checking.
If the server isn't responding to the check command, this would be a non-OK state for that service.
To directly answer the question though, there is no check to get the server to say it is unresponsive because it would not respond to those as well.
If the server isn't responding to the check command, this would be a non-OK state for that service.
To directly answer the question though, there is no check to get the server to say it is unresponsive because it would not respond to those as well.
Re: How to check window and linux server unresponsive or fre
hi scottwilkerson,
Thank you for your reply. I got your point.
I am using NCPA agent.
Based on one article https://techcommunity.microsoft.com/t5/ ... a-p/375577 for windows, one of them is to check if we can map or access \\serverName\C$. But not sure is this also apply to Linux or other OS. Is there existing plugins or check command for mapping drives?
Otherwise, any suggestions to say the server is unresponsive based on some existing checks? for example, we can combine Ping test and NCPA (get the agent version) checks to determine server unresponsive.
Trying to find a better way to check.
Thank you,
Joseph
Thank you for your reply. I got your point.
I am using NCPA agent.
Based on one article https://techcommunity.microsoft.com/t5/ ... a-p/375577 for windows, one of them is to check if we can map or access \\serverName\C$. But not sure is this also apply to Linux or other OS. Is there existing plugins or check command for mapping drives?
Otherwise, any suggestions to say the server is unresponsive based on some existing checks? for example, we can combine Ping test and NCPA (get the agent version) checks to determine server unresponsive.
Trying to find a better way to check.
Thank you,
Joseph
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: How to check window and linux server unresponsive or fre
I would think if you cannot do a basic check to get the agent version, it is definitely a problem all on it's own and should be enough.
Sometimes servers respond to ping but are locked up and won't do anything else.
Sometimes servers respond to ping but are locked up and won't do anything else.
Re: How to check window and linux server unresponsive or fre
Hi scottwilkerson,
NCPA agent passive check has one entry
%HOSTNAME%|__HOST__ = system/agent_version
is this required for all the passive checks or optional?
Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?
Thanks
Joseph
NCPA agent passive check has one entry
%HOSTNAME%|__HOST__ = system/agent_version
is this required for all the passive checks or optional?
Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?
Thanks
Joseph
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: How to check window and linux server unresponsive or fre
optional, this is the host checkJosephH wrote:is this required for all the passive checks or optional?
You can only have one host check, but you can setup a Ping service check.JosephH wrote:Do you know if we setup an active Ping test on this server, does this conflict with the above host check in Nagios XI?
Easiest way is to run the Generic Network Device Wizard and use the same hostname.