All checks, except PING, not responding on windows servers
Posted: Tue Nov 14, 2017 7:01 am
Please bare with me as i'm still learning to navigate my way around Nagios!
I have 2 windows server that were already set and being monitored for CPU/Memory/Disk space etc checks. They are set up identical to each other except the one is located on-site, the other is at our DR site (if that matters)
I wanted to add a new adhoc powershell script to check something on the servers. So I defined this check in the nrpecheck.cfg file as follows
went to the windows server, Added the line to the custom.ini file on both servers:
put the script in the path c:\programfiles\NSCLient++\scripts
Restarted the NSClients, then reloaded the Nagios cor. I've done this previously and never had a problem.
This time however...
My new script was timing out on both servers no matter the time out interval. All previously set up checks on server "test2" also flagged with timeout warnings.
I removed all changes to both servers, and the new defined service from nagios and restarted the services. "test" returned to normal "test2" still timed out.
In addition, Any attempt to manually run a check off the nagiso cor server
such as returns a connection refused by host. With the exception of a ping and check_nt
At this point I'm lost and I'm starting to think this is probably not a nagios specific problem, but I wanted to rule that out first!
Any help or points in the right direction would be appreciated.
Regards
I have 2 windows server that were already set and being monitored for CPU/Memory/Disk space etc checks. They are set up identical to each other except the one is located on-site, the other is at our DR site (if that matters)
I wanted to add a new adhoc powershell script to check something on the servers. So I defined this check in the nrpecheck.cfg file as follows
Code: Select all
define service{
use generic-service
host_name test,test2
service_description TLs check
check_command check_nrpe!TLSprotocol
check_interval 60
}
Code: Select all
TLSprotocol = cmd /c echo scripts\\TLSprotocol.ps1; exit($lastexitcode) | powershell.exe -command-
Restarted the NSClients, then reloaded the Nagios cor. I've done this previously and never had a problem.
This time however...
My new script was timing out on both servers no matter the time out interval. All previously set up checks on server "test2" also flagged with timeout warnings.
I removed all changes to both servers, and the new defined service from nagios and restarted the services. "test" returned to normal "test2" still timed out.
In addition, Any attempt to manually run a check off the nagiso cor server
such as
Code: Select all
/usr/local/nagios/libexec/check_nrpe -H test2 -c alias_disk
At this point I'm lost and I'm starting to think this is probably not a nagios specific problem, but I wanted to rule that out first!
Any help or points in the right direction would be appreciated.
Regards