AD/Witness Server Restart : Critical Alarm for URL Checks
-
- Posts: 72
- Joined: Wed Feb 06, 2019 3:22 pm
Re: AD/Witness Server Restart : Critical Alarm for URL Check
I'm going to PM the information I have
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: AD/Witness Server Restart : Critical Alarm for URL Check
Based on what you sent prod1.yyyy.com always resolves to the same IP address, this doesn't show what happens when the server at that IP address fails to respond
-
- Posts: 72
- Joined: Wed Feb 06, 2019 3:22 pm
Re: AD/Witness Server Restart : Critical Alarm for URL Check
I'm working with some third party on this and they stated the following:
That sounds like he is asking for information from the device that is hosting the site (Devices APIServerA and APIServerB). However the problem is that the site isn't going down during this issue. Nagios only thinks it is.
Can you explain what you want from us so we can help you figure this out?
That sounds like he is asking for information from the device that is hosting the site (Devices APIServerA and APIServerB). However the problem is that the site isn't going down during this issue. Nagios only thinks it is.
Can you explain what you want from us so we can help you figure this out?
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: AD/Witness Server Restart : Critical Alarm for URL Check
When the site goes down I'm assuming the DNS changes from xxx.xxx.xxx.xxx to yyy.yyy.yyy.yyy
Does this happen? And if so, then I need to see that the Nagios server is making the request to yyy.yyy.yyy.yyy and then as a followup that yyy.yyy.yyy.yyy is returning the expected result.
Nagios works by calling plugins that interact with the destination servers, and if it doesn't go to the correct server, that would be a problem for Nagios the same way it would be a problem for your users.
Nagios gives results based on it calling commands that you define, and in this example making a call to a website that you define, based on the command definition and arguments you pass to it.
Does this happen? And if so, then I need to see that the Nagios server is making the request to yyy.yyy.yyy.yyy and then as a followup that yyy.yyy.yyy.yyy is returning the expected result.
Nagios works by calling plugins that interact with the destination servers, and if it doesn't go to the correct server, that would be a problem for Nagios the same way it would be a problem for your users.
Nagios gives results based on it calling commands that you define, and in this example making a call to a website that you define, based on the command definition and arguments you pass to it.
-
- Posts: 72
- Joined: Wed Feb 06, 2019 3:22 pm
Re: AD/Witness Server Restart : Critical Alarm for URL Check
PM-ing you what the third party responded with.
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: AD/Witness Server Restart : Critical Alarm for URL Check
The errors you are seeing are from the URL Status service
Can you post the command definition for the check_service_http command
Can you post the command definition for the check_service_http command
-
- Posts: 72
- Joined: Wed Feb 06, 2019 3:22 pm
Re: AD/Witness Server Restart : Critical Alarm for URL Check
I have it listed under my commands cfg file
define command {
command_name check_service_http
command_line $USER1$/check_http -H $HOSTADDRESS$ $ARG1$
}
define command {
command_name check_service_http
command_line $USER1$/check_http -H $HOSTADDRESS$ $ARG1$
}
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: AD/Witness Server Restart : Critical Alarm for URL Check
When you reboot the server next time you can run the following from the Nagios Server CLI as many times as you want for verbose output of the command that is failing
This is the exact command nagios is running based on the config fragments you have posted, but we added -vvv to get verbose output
Code: Select all
/usr/local/nagios/libexec/check_http -H prod1.yyyy.com -f follow -u '/api/api.asmx' -vvv
-
- Posts: 72
- Joined: Wed Feb 06, 2019 3:22 pm
Re: AD/Witness Server Restart : Critical Alarm for URL Check
Ran the following command provided. While down we were on the secondary dns also.
The output when AD down restarting:
[root@]# /usr/local/nagios/libexec/check_http -H prod1.yyyyy.com -f follow -u '/api/api.asmx' -vvv
GET /api/api.asmx HTTP/1.1
User-Agent: check_http/v2.2.1.git (nagios-plugins 2.2.1)
Connection: close
Host: prod1.yyyy.com
Accept: */*
http://prod1.yyyy.com:80/api/api.asmx is 137 characters
STATUS: HTTP/1.0 301 Moved Permanently
**** HEADER ****
location: https://prod1.yyyyy.com/api/api.asmx
Server: BigIP
Connection: close
Content-Length: 0
**** CONTENT ****
Redirection to https://prod1.yyyy.com:443/api/api.asmx
CRITICAL - Socket timeout
The output when AD down restarting:
[root@]# /usr/local/nagios/libexec/check_http -H prod1.yyyyy.com -f follow -u '/api/api.asmx' -vvv
GET /api/api.asmx HTTP/1.1
User-Agent: check_http/v2.2.1.git (nagios-plugins 2.2.1)
Connection: close
Host: prod1.yyyy.com
Accept: */*
http://prod1.yyyy.com:80/api/api.asmx is 137 characters
STATUS: HTTP/1.0 301 Moved Permanently
**** HEADER ****
location: https://prod1.yyyyy.com/api/api.asmx
Server: BigIP
Connection: close
Content-Length: 0
**** CONTENT ****
Redirection to https://prod1.yyyy.com:443/api/api.asmx
CRITICAL - Socket timeout
-
- Posts: 72
- Joined: Wed Feb 06, 2019 3:22 pm
Re: AD/Witness Server Restart : Critical Alarm for URL Check
Additional Info when working properly:
[root@]# /usr/local/nagios/libexec/check_http -H prod1.yyyy.com -f follow -u '/api/api.asmx' -vvv
GET /api/api.asmx HTTP/1.1
User-Agent: check_http/v2.2.1.git (nagios-plugins 2.2.1)
Connection: close
Host: prod1.yyyy.com
Accept: */*
http://prod1.yyyy.com:80/api/api.asmx is 137 characters
STATUS: HTTP/1.0 301 Moved Permanently
**** HEADER ****
location: https://prod1.yyyy.com/api/api.asmx
Server: BigIP
Connection: close
Content-Length: 0
**** CONTENT ****
Redirection to https://prod1.yyyy.com:443/api/api.asmx
SSL initialized
GET /api/api.asmx HTTP/1.1
User-Agent: check_http/v2.2.1.git (nagios-plugins 2.2.1)
Connection: close
Host: prod1.yyyy.com
Accept: */*
https://prod1.yyyy.com:443/api/api.asmx is 28695 characters
STATUS: HTTP/1.1 200 OK
**** HEADER ****
Cache-Control: private, max-age=0
Content-Type: text/html; charset=utf-8
Server: Microsoft-IIS/8.5
X-Powered-By: ASP.NET
X-Robots-Tag: noindex
Access-Control-Allow-Origin: *
Access-Control-Allow-Headers: accept, content-type, soapaction
Access-Control-Request-Method: POST
Date: Fri, 11 Oct 2019 21:12:25 GMT
Connection: close
Content-Length: 28223
Strict-Transport-Security: max-age=16070400; includeSubDomains
X-Frame-Options: SAMEORIGIN
**** CONTENT ****
<html>
-
</body>
</html>
HTTP OK: HTTP/1.1 200 OK - 28695 bytes in 0.012 second response time |time=0.011678s;;;0.000000 size=28695B;;;0
[root@]# /usr/local/nagios/libexec/check_http -H prod1.yyyy.com -f follow -u '/api/api.asmx' -vvv
GET /api/api.asmx HTTP/1.1
User-Agent: check_http/v2.2.1.git (nagios-plugins 2.2.1)
Connection: close
Host: prod1.yyyy.com
Accept: */*
http://prod1.yyyy.com:80/api/api.asmx is 137 characters
STATUS: HTTP/1.0 301 Moved Permanently
**** HEADER ****
location: https://prod1.yyyy.com/api/api.asmx
Server: BigIP
Connection: close
Content-Length: 0
**** CONTENT ****
Redirection to https://prod1.yyyy.com:443/api/api.asmx
SSL initialized
GET /api/api.asmx HTTP/1.1
User-Agent: check_http/v2.2.1.git (nagios-plugins 2.2.1)
Connection: close
Host: prod1.yyyy.com
Accept: */*
https://prod1.yyyy.com:443/api/api.asmx is 28695 characters
STATUS: HTTP/1.1 200 OK
**** HEADER ****
Cache-Control: private, max-age=0
Content-Type: text/html; charset=utf-8
Server: Microsoft-IIS/8.5
X-Powered-By: ASP.NET
X-Robots-Tag: noindex
Access-Control-Allow-Origin: *
Access-Control-Allow-Headers: accept, content-type, soapaction
Access-Control-Request-Method: POST
Date: Fri, 11 Oct 2019 21:12:25 GMT
Connection: close
Content-Length: 28223
Strict-Transport-Security: max-age=16070400; includeSubDomains
X-Frame-Options: SAMEORIGIN
**** CONTENT ****
<html>
-
</body>
</html>
HTTP OK: HTTP/1.1 200 OK - 28695 bytes in 0.012 second response time |time=0.011678s;;;0.000000 size=28695B;;;0