Page 1 of 2

Two threshold types in one

Posted: Mon Jan 19, 2015 4:45 am
by reincarne
Hi, sorry for the title which doesn't say much :)
I have a question related to thresholds. Is there a way to put in one check two types of thresholds?
For example, I want to put thresholds for minimal and maximal: Warning/Critical if the value goes down, Warning/Critical if values goes too high.

Re: Two threshold types in one

Posted: Mon Jan 19, 2015 9:23 am
by jwelch
Short answer: It depends.

There is a format for ranges, but it depends on whether or not the specific check script you want to use
supports that format.

See: https://nagios-plugins.org/doc/guidelin ... HOLDFORMAT

Check the help text for plugin you want to use:

(note the usage of 'range' in the warning and critical threhold specifiers and the first example using the ':'.)

Code: Select all

/usr/local/nagios/libexec/check_procs -h
check_procs v2.0.3 (nagios-plugins 2.0.3)
Copyright (c) 1999 Ethan Galstad <[email protected]>
Copyright (c) 2000-2014 Nagios Plugin Development Team
        <[email protected]>

Checks all processes and generates WARNING or CRITICAL states if the specified
metric is outside the required threshold ranges. The metric defaults to number
of processes.  Search filters can be applied to limit the processes to check.


Usage:
check_procs -w <range> -c <range> [-m metric] [-s state] [-p ppid]
 [-u user] [-r rss] [-z vsz] [-P %cpu] [-a argument-array]
 [-C command] [-k] [-t timeout] [-v]

Options:
 -h, --help
    Print detailed help screen
 -V, --version
    Print version information
 --extra-opts=[section][@file]
    Read options from an ini file. See
    https://www.nagios-plugins.org/doc/extra-opts.html
    for usage and examples.
 -w, --warning=RANGE
   Generate warning state if metric is outside this range
 -c, --critical=RANGE
   Generate critical state if metric is outside this range
 -m, --metric=TYPE
  Check thresholds against metric. Valid types:
  PROCS   - number of processes (default)
  VSZ     - virtual memory size
  RSS     - resident set memory size
  CPU     - percentage CPU
  ELAPSED - time elapsed in seconds
 -t, --timeout=INTEGER
    Seconds before plugin times out (default: 10)
 -v, --verbose
    Extra information. Up to 3 verbosity levels
 -T, --traditional
   Filter own process the traditional way by PID instead of /proc/pid/exe

Filters:
 -s, --state=STATUSFLAGS
   Only scan for processes that have, in the output of `ps`, one or
   more of the status flags you specify (for example R, Z, S, RS,
   RSZDT, plus others based on the output of your 'ps' command).
 -p, --ppid=PPID
   Only scan for children of the parent process ID indicated.
 -z, --vsz=VSZ
   Only scan for processes with VSZ higher than indicated.
 -r, --rss=RSS
   Only scan for processes with RSS higher than indicated.
 -P, --pcpu=PCPU
   Only scan for processes with PCPU higher than indicated.
 -u, --user=USER
   Only scan for processes with user name or ID indicated.
 -a, --argument-array=STRING
   Only scan for processes with args that contain STRING.
 --ereg-argument-array=STRING
   Only scan for processes with args that contain the regex STRING.
 -C, --command=COMMAND
   Only scan for exact matches of COMMAND (without path).
 -k, --no-kthreads
   Only scan for non kernel threads (works on Linux only).

RANGEs are specified 'min:max' or 'min:' or ':max' (or 'max'). If
specified 'max:min', a warning status will be generated if the
count is inside the specified range

This plugin checks the number of currently running processes and
generates WARNING or CRITICAL states if the process count is outside
the specified threshold ranges. The process count can be filtered by
process owner, parent process PID, current state (e.g., 'Z'), or may
be the total number of running processes

Examples:
 check_procs -w 2:2 -c 2:1024 -C portsentry
  Warning if not two processes with command name portsentry.
  Critical if < 2 or > 1024 processes

 check_procs -w 10 -a '/usr/local/bin/perl' -u root
  Warning alert if > 10 processes with command arguments containing
  '/usr/local/bin/perl' and owned by root

 check_procs -w 50000 -c 100000 --metric=VSZ
  Alert if VSZ of any processes over 50K or 100K

 check_procs -w 10 -c 20 --metric=CPU
  Alert if CPU of any processes over 10%% or 20%%

Send email to [email protected] if you have questions regarding use
of this software. To submit patches or suggest improvements, send email to
[email protected]
Then run it from the command line to verify that it works the way you want:

Code: Select all

 ./check_procs -w 1:10 -c 0:100 -C /usr/sbin/httpd
PROCS WARNING: 0 processes with command name '/usr/sbin/httpd' | procs=0;1:10;0:100;0;
 ./check_procs -w 1:10 -c 0:100 -C httpd
PROCS WARNING: 21 processes with command name 'httpd' | procs=21;1:10;0:100;0;
 ./check_procs -w 1:30 -c 0:100 -C httpd
PROCS OK: 21 processes with command name 'httpd' | procs=21;1:30;0:100;0;
 ./check_procs -w 25:30 -c 0:100 -C httpd
PROCS WARNING: 21 processes with command name 'httpd' | procs=21;25:30;0:100;0;

Re: Two threshold types in one

Posted: Mon Jan 19, 2015 12:21 pm
by lmiltchev
reincarne, let us know if jwelch answered your question.

Re: Two threshold types in one

Posted: Wed Jan 21, 2015 7:24 am
by reincarne
Thanks, I will try it first.

Re: Two threshold types in one

Posted: Wed Jan 21, 2015 11:25 am
by lmiltchev
OK, keep us posted.

Re: Two threshold types in one

Posted: Fri Jan 23, 2015 12:02 pm
by jwelch
Before I stumbled onto the 'official' method of designating ranges, I used
-w -c for lower limits and -W -C for upper limits in my own scripts. Your
script (depending on it's source) could do something similar or use another
method...only way to be sure is to look at the code.

Re: Two threshold types in one

Posted: Fri Jan 23, 2015 2:13 pm
by tmcdonald
reincarne, what specific plugin are you using? Can you link to it?

Re: Two threshold types in one

Posted: Mon Apr 06, 2015 2:37 am
by reincarne
tmcdonald wrote:reincarne, what specific plugin are you using? Can you link to it?

The question was general.
I understood that it depends on the plugin - if plugin support Maximal and Minimal thresholds then it is possible, if not then Nagios doesn't know to use the negative logic which is pretty disappointing.

Re: Two threshold types in one

Posted: Mon Apr 06, 2015 9:08 am
by tmcdonald
It's not that Nagios doesn't know how to use the logic, it's that the logic has not been presented to Nagios. There are plenty of situations where a negative reading could be perfectly valid (temperatures being the first to come to mind) and if Nagios treated all negative values the same way then some plugins would fail to work properly. This is why we have the plugins themselves interpret readings as they see fit, then report this info to Nagios for further processing.

Re: Two threshold types in one

Posted: Sun Apr 12, 2015 4:56 am
by reincarne
tmcdonald wrote:It's not that Nagios doesn't know how to use the logic, it's that the logic has not been presented to Nagios. There are plenty of situations where a negative reading could be perfectly valid (temperatures being the first to come to mind) and if Nagios treated all negative values the same way then some plugins would fail to work properly. This is why we have the plugins themselves interpret readings as they see fit, then report this info to Nagios for further processing.

Thanks mate :)