Page 1 of 1

Report similar ro SLA but only include some services

Posted: Fri Apr 24, 2020 12:58 pm
by gormank
I'd like to get SLA info on some checks which are redundantly running on multiple (4 for example) hosts. If 3 of 4 of the checks succeed, the app is still working and there's no customer SLA impact. Of course there are lots of hosts and many checks.
Using hostgroups would include hosts so if one or more is down for example, it would impact the regular SLA report. Servicegroups are very tedious and don't seem like a good solution.
Any ideas?

Re: Report similar ro SLA but only include some services

Posted: Fri Apr 24, 2020 2:47 pm
by ssax
What I would probably do is create a BPI group and use the BPI Wizard to monitor that BPI group and use that for the SLA (in a servicegroup).

Or you could technically use the check_cluster plugin to do what you want:

https://assets.nagios.com/downloads/nag ... sters.html

Both would create a new service that would show the status of what you want (the real SLA).

I can't really think of any other options.

That doesn't really fix the SLA report though, only using the servicegroups would limit the services.

Re: Report similar ro SLA but only include some services

Posted: Mon Apr 27, 2020 11:57 am
by gormank
I'd never seen the cluster check before. Thanks for the pointer to it. Looks interesting.