Issue plotting memory usage data in Grafana
Posted: Wed Sep 12, 2018 8:55 am
Good afternoon - as the title of my thread suggests, this is PROBABLY a Grafana issue, rather than a Nagios one, but I'm just hoping that it might ring a bell with someone, so here goes...
In Grafana, I have set up a dashboard with graphs that monitor memory usage based on the following Nagios command and service definitions
However, the memory usage values returned seem higher than if I obtain them by remoting into the servers directly, so I tried to replicate the above functionality with check_nrpe using the following:
Whilst the values returned in Nagios for the check_nt and check_nrpe services match each other in the Nagios web page, when I try to set up graphs for these in a Grafana dashboard, whilst the check_nt charts work fine, the check_nrpe ones don't - they pretty much flatline.
I have the Y Axis unit set to megabytes and whilst check_nt reports values in the range 2.0-3.0 GB, check_nrpe shows minimal MB values.
I'm assuming that this is something to do with the way in which the check_nrpe data is parsing, but I have no idea as to where to start in resolving this, so any advice would be gratefully received!
Thanks in advance
Pete
In Grafana, I have set up a dashboard with graphs that monitor memory usage based on the following Nagios command and service definitions
Code: Select all
define command{
command_name check_nt
command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$
}
define service{
use generic-service
#host_name MyServer
hostgroup_name MyServers
service_description Win_TotMem_NT
check_command check_nt!MEMUSE!-w 90 -c 95
}
Code: Select all
define command{
command_name check_nrpe_totmem
command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -p 5666 -c CheckMEM -a MaxWarn=80% MaxCrit=90% ShowAll=long
}
define service{
use generic-service
#host_name MyServer
hostgroup_name MyServers
service_description Win_TotMem_NRPE
check_command check_nrpe_totmem
}
I have the Y Axis unit set to megabytes and whilst check_nt reports values in the range 2.0-3.0 GB, check_nrpe shows minimal MB values.
I'm assuming that this is something to do with the way in which the check_nrpe data is parsing, but I have no idea as to where to start in resolving this, so any advice would be gratefully received!
Thanks in advance
Pete