Page 1 of 9
Problem with monitoring Windows 2012 R2 servers
Posted: Tue Jul 22, 2014 9:41 am
by Frédéric GRANAT
Hi,
Nagios XI version : Nagios XI 2012R1.6
Sometimes with CPU check launches CRITICAL alert whereas there's no overload with CPU.
This has been noticed especially on Windows 2012 R2 servers.
Rgds,
Frederic
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Tue Jul 22, 2014 10:27 am
by slansing
Can we get a copy of one of the critical check results? Is there anything strange about what is being returned? Is it standard output that you would expect (besides the state that is being carried) or are they timing out? etc?
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Tue Jul 22, 2014 10:34 am
by sreinhardt
What check are you using to monitor these systems? What values are you receiving when these do go critical? How often is this happening?
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Fri Jul 25, 2014 2:10 am
by Frédéric GRANAT
Hi,
I'm using $USER1$/check_wmi_plus.pl -H $HOSTADDRESS$ -u $ARG1$ -p $ARG2$ -m checkcpu -w $ARG3$ -c $ARG4$ $ARG5$
Display at the moment in Nagios XI
server1 CPU Usage 2h 7m 38s CRITICAL (Sample Period 26 sec) - [Triggered by _AvgCPU>90] - Average CPU Utilisation 100.00%
But if I run ./check_wmi_plus.pl -H server1 -u user -p pwd -m checkcpu -w 80 -c 90
OK (Sample Period 12 sec) - Average CPU Utilisation 5.62%|'Avg CPU Utilisation'=5.62%;80;90;
So Correct in command line and wrong in nagiosxi
If I perform a "Schedule an immediate check" in nagiosxi then it corrects the problem.
Seems to be a refresh problem
Rgds,
Frederic
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Fri Jul 25, 2014 11:20 am
by sreinhardt
Was it spiked at 100% throughout the entire 2 hour window prior to the immediate check? Could you show a screenshot of the graph over this period?
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Wed Jul 30, 2014 2:04 am
by Frédéric GRANAT
Hi,
the graph is attached
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Wed Jul 30, 2014 9:23 am
by tmcdonald
Frédéric GRANAT wrote:Hi,
the graph is attached
It looks like you have not attached anything.
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Thu Jul 31, 2014 2:51 am
by Frédéric GRANAT
I understand : The extension bmp is not allowed.
I attached it in jpeg format
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Thu Jul 31, 2014 1:39 pm
by lmiltchev
This graph doesn't look right... How did you set up this check? Did you run the Windows WMI wizard? I can see on the graph that your check command is "check_win_cpu". Mine is "check_xi_service_wmiplus"... Go to the CCM->Services->View Text Config (the diskette icon) and show us the service config for the CPU Usage" (hide sensitive info). Also, show us the command definition of "check_win_cpu".
Re: Problem with monitoring Windows 2012 R2 servers
Posted: Fri Aug 01, 2014 7:45 am
by Frédéric GRANAT
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2014-08-01 14:41:15
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################
define service {
host_name server
service_description CPU Usage
use windows-cpu
servicegroups Serveurs applications metiers
register 1
}
define service {
host_name server
service_description Drives : Disk Usage
use windows-disk
register 1
}
define service {
host_name server
service_description Memory Usage
use windows-mem
register 1
}
define service {
host_name server
service_description Services
use windows-service
register 1
}
define service {
host_name server
service_description Uptime
use windows-uptime
register 1
}
###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################
Definition of check_win_cpu :
$USER1$/check_wmi_plus.pl -H $HOSTADDRESS$ -u $ARG1$ -p $ARG2$ -m checkcpu -w $ARG3$ -c $ARG4$ $ARG5$