It seems there is a bug in health calculation in check_bpi.
Imagine two services – one(svc1) is in OK state, another(svc2) is in CRITICAL state and you use both of them in BPI group. If svc2 is NOT essential member, health of group is 50% - this is correct and expected. Hovever, if I mark svc2 as essential member, health is 100%. In this case, I’d expect health is either 50%, or 0%(very important component (essential member) of service (BPI group) is CRITICAL).
Now, health graphs are usable only for BPI groups with no essential members. And I cannot show to my managers health 100% when service was actually down.
In addition, here are some suggestions to improve BPI component:
- Add filter box for Available Hosts, Services and BPI Groups list. If this list contains thousands of items, it is hard to find the one I need.
- Add checkboxes to items in Authorized Users list. Selecting just rows gives room for one wrong click, and all authorized users will be no longer authorized... Alternatively, create two lists – All Users and Authorized Users + buttons to move users between these lists. The latter one will be even better – you will see all authorized users without need to scroll list of all users(especially useful when you have many users).
Thanks.