Hi there,
I'm search for a way to aggregate notifications together in XI and am experimenting with the BPI feature. I have it working so far with the warning and critical thresholds thanks to the XI BPI Wizard. When a notification is triggered, I get an email with the following message:
CRITICAL - Group health below critical threshold of 90%! Health is 50.00% with 2 problem(s).
In the case above, I had a BPI Group with 4 services, 2 of them I forced into a CRITICAL state. How can I make the alert list the 2 "problem" services in the notification message? This would be ALOT more helpful to the person receiving the notifications.
I agree that would be more helpful. Unfortunately there is not a way to achieve this at the moment in XI, but I did file a feature request for this : TASK ID 9521
Sorry we can''t make this happen right now, but hopefully dev will be able to make it happen in a future release.
Be sure to check out the Knowledgebase for helpful articles and solutions!
bwallace wrote:I agree that would be more helpful. Unfortunately there is not a way to achieve this at the moment in XI, but I did file a feature request for this : TASK ID 9521
Sorry we can''t make this happen right now, but hopefully dev will be able to make it happen in a future release.
That's fantastic, thanks! Is there any way for me to view/track the request?
I've been hacking together some Ruby scripts to basically do this.
Sorry, there is no way for you to track it since I filed it internally. You'll just have to keep a watchful eye on the release notes. You should be commended on getting ruby scripts to do this!
Be sure to check out the Knowledgebase for helpful articles and solutions!
bwallace wrote:Sorry, there is no way for you to track it since I filed it internally. You'll just have to keep a watchful eye on the release notes. You should be commended on getting ruby scripts to do this!
https://github.com/ripienaar/ruby-nagio ... /README.md
Look at check_check. It will look at services of all or a specific host (excluding itself) and email you if X percent of services are down, or X quantity. It'll also lost out the affected services.