2 Criticals but only 1 Listed in Nagios Xi

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
michal.nastaly
Posts: 59
Joined: Tue Jun 02, 2015 9:24 am

2 Criticals but only 1 Listed in Nagios Xi

Post by michal.nastaly »

I have the following command setup to passively monitor drives on every server.

[/settings/scheduler/schedules/Drives]
check_drivesize "exclude=System Reserved" "filter=type='fixed'" "perf-syntax=${drive}"
interval = 10m

The command works fine but when 2 drives report critical alert, Nagios XI only shows 1 critical on a dashlet (i.e. service called Drives goes red and only reports 1 critical.

Is there a way to change this? So that when x drives report critical alerts the same gets reported on a dashlet.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: 2 Criticals but only 1 Listed in Nagios Xi

Post by tgriep »

What OS and version is the server running and the version of the NSClient++ do you have installed on the remote system?
Can you provide a screen shot of the Dashlet and post it here?
Be sure to check out our Knowledgebase for helpful articles and solutions!
michal.nastaly
Posts: 59
Joined: Tue Jun 02, 2015 9:24 am

Re: 2 Criticals but only 1 Listed in Nagios Xi

Post by michal.nastaly »

The nsclient is 5.0.62 (newest)
and the Nagios XI server is: 5.4.2

The dashlet that we are using is "Service Group Summary".
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: 2 Criticals but only 1 Listed in Nagios Xi

Post by tgriep »

It look like that was a bug that was fixed last year but your command would have to have the critical count added to get the info you are looking for.
Try adding this to the command in the nsclient.ini file on your system and from what the post below says, it should give you the correct count error.

Code: Select all

"critical=count<5"
Adjust the count number to the number of drives in the system.
https://github.com/mickem/nscp/issues/294
Be sure to check out our Knowledgebase for helpful articles and solutions!
michal.nastaly
Posts: 59
Joined: Tue Jun 02, 2015 9:24 am

Re: 2 Criticals but only 1 Listed in Nagios Xi

Post by michal.nastaly »

Count is not what I'm after.

Config:

Code: Select all

; Check the status of the hosts drives
[/settings/scheduler/schedules/Drives]
command = check_drivesize "filter=type='fixed'" "warning=((free<10g) and (drive='C:\\')) or ((size<500g) and (used>80%) and (drive!='C:\\')) or ((size>500g) and (free<100g) and (drive!='C:\\'))" "critical=((free<5g) and (drive='C:\\')) or ((size<500g) and (used>90%) and (drive!='C:\\')) or ((size>500g) and (free<50g) and (drive!='C:\\'))" "ok-syntax=${list}" "detail-syntax=Drive: %(drive_or_name), Free: %(free), Used: %(used), Total: %(size)"
interval = 15m
Drives:
C, D, E and F for the sake of argument.

When C fills up and cause critical alert the dashboard reports 1 critical for the service group that this check is in.
When C and D fill up and cause critical alerts the dashboard still reports 1, but i would like it to report 2. This happens because both of the drives are part of the same service check. Is there a way of altering it so that the number of critical alerts for a given service group is the number of drives that reported the critical alert. As far as I understand the "count" will only count number of drives not do what i need it to do.
User avatar
mcapra
Posts: 3739
Joined: Thu May 05, 2016 3:54 pm

Re: 2 Criticals but only 1 Listed in Nagios Xi

Post by mcapra »

Since there's only 1 service check being ran, there will only ever be 1 state for that service. All Nagios XI has a concept of (count wise) is how many checks are in specific states. If you have a single check that is checking multiple things, that's all fine and dandy but it will always only ever be recognized as a single state and counted as a single state.

For what you want, I think you'd have do define multiple service checks. That's the only way you could, via the chosen dashlet, get a reliable count of how many drives on this remote machine are in a "CRITICAL" state.
Former Nagios employee
https://www.mcapra.com/
Locked