Page 2 of 2

Re: Creating an alert for host stuck in boot loop

Posted: Mon Mar 02, 2020 5:21 pm
by rferebee
Yes, that's true, but in our specific use case. I don't think upping the max retries would have a detrimental effect, because we're dealing with fast boot loops. The server just keeps rebooting itself over and over within a few minutes of the last reboot.

I'm thinking if we change the check interval to 15 minutes as well as include 10 retry attempts every 1 minute, that should weed out any "normal" reboot like patches. Or, maybe 10 minute check intervals with 7 max retries every 1 minute.

I just don't want a notification every time a server reboots. In my testing the 5-1-5 setup sometimes wasn't flipping back to OK in time to not send the notification, even though the server had already booted up.

Re: Creating an alert for host stuck in boot loop

Posted: Mon Mar 02, 2020 6:03 pm
by mbellerue
Yeah, actually when you put it like that, I think it will suit your needs. And if not, definitely play with it until it does work for you.

Re: Creating an alert for host stuck in boot loop

Posted: Wed Mar 04, 2020 11:19 am
by rferebee
I think we're good, you can go ahead and lock this. Thanks for your help!