I have the following command setup to passively monitor drives on every server.
[/settings/scheduler/schedules/Drives]
check_drivesize "exclude=System Reserved" "filter=type='fixed'" "perf-syntax=${drive}"
interval = 10m
The command works fine but when 2 drives report critical alert, Nagios XI only shows 1 critical on a dashlet (i.e. service called Drives goes red and only reports 1 critical.
Is there a way to change this? So that when x drives report critical alerts the same gets reported on a dashlet.
2 Criticals but only 1 Listed in Nagios Xi
-
michal.nastaly
- Posts: 59
- Joined: Tue Jun 02, 2015 9:24 am
Re: 2 Criticals but only 1 Listed in Nagios Xi
What OS and version is the server running and the version of the NSClient++ do you have installed on the remote system?
Can you provide a screen shot of the Dashlet and post it here?
Can you provide a screen shot of the Dashlet and post it here?
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
michal.nastaly
- Posts: 59
- Joined: Tue Jun 02, 2015 9:24 am
Re: 2 Criticals but only 1 Listed in Nagios Xi
The nsclient is 5.0.62 (newest)
and the Nagios XI server is: 5.4.2
The dashlet that we are using is "Service Group Summary".
and the Nagios XI server is: 5.4.2
The dashlet that we are using is "Service Group Summary".
Re: 2 Criticals but only 1 Listed in Nagios Xi
It look like that was a bug that was fixed last year but your command would have to have the critical count added to get the info you are looking for.
Try adding this to the command in the nsclient.ini file on your system and from what the post below says, it should give you the correct count error.
Adjust the count number to the number of drives in the system.
https://github.com/mickem/nscp/issues/294
Try adding this to the command in the nsclient.ini file on your system and from what the post below says, it should give you the correct count error.
Code: Select all
"critical=count<5"https://github.com/mickem/nscp/issues/294
Be sure to check out our Knowledgebase for helpful articles and solutions!
-
michal.nastaly
- Posts: 59
- Joined: Tue Jun 02, 2015 9:24 am
Re: 2 Criticals but only 1 Listed in Nagios Xi
Count is not what I'm after.
Config:
Drives:
C, D, E and F for the sake of argument.
When C fills up and cause critical alert the dashboard reports 1 critical for the service group that this check is in.
When C and D fill up and cause critical alerts the dashboard still reports 1, but i would like it to report 2. This happens because both of the drives are part of the same service check. Is there a way of altering it so that the number of critical alerts for a given service group is the number of drives that reported the critical alert. As far as I understand the "count" will only count number of drives not do what i need it to do.
Config:
Code: Select all
; Check the status of the hosts drives
[/settings/scheduler/schedules/Drives]
command = check_drivesize "filter=type='fixed'" "warning=((free<10g) and (drive='C:\\')) or ((size<500g) and (used>80%) and (drive!='C:\\')) or ((size>500g) and (free<100g) and (drive!='C:\\'))" "critical=((free<5g) and (drive='C:\\')) or ((size<500g) and (used>90%) and (drive!='C:\\')) or ((size>500g) and (free<50g) and (drive!='C:\\'))" "ok-syntax=${list}" "detail-syntax=Drive: %(drive_or_name), Free: %(free), Used: %(used), Total: %(size)"
interval = 15mC, D, E and F for the sake of argument.
When C fills up and cause critical alert the dashboard reports 1 critical for the service group that this check is in.
When C and D fill up and cause critical alerts the dashboard still reports 1, but i would like it to report 2. This happens because both of the drives are part of the same service check. Is there a way of altering it so that the number of critical alerts for a given service group is the number of drives that reported the critical alert. As far as I understand the "count" will only count number of drives not do what i need it to do.
Re: 2 Criticals but only 1 Listed in Nagios Xi
Since there's only 1 service check being ran, there will only ever be 1 state for that service. All Nagios XI has a concept of (count wise) is how many checks are in specific states. If you have a single check that is checking multiple things, that's all fine and dandy but it will always only ever be recognized as a single state and counted as a single state.
For what you want, I think you'd have do define multiple service checks. That's the only way you could, via the chosen dashlet, get a reliable count of how many drives on this remote machine are in a "CRITICAL" state.
For what you want, I think you'd have do define multiple service checks. That's the only way you could, via the chosen dashlet, get a reliable count of how many drives on this remote machine are in a "CRITICAL" state.
Former Nagios employee
https://www.mcapra.com/
https://www.mcapra.com/