Just finished a 4-day weekend......
Through out my weekend, 1 service went into a critical state.... It was something that could wait till I returned. So I remoteded in, acknowledged it and went on my merry way.....
Nagios had other ideas.....
when looking that the notification log..... I see where it alerted on being critical, I responded to it.....then it would send a warning 6 min later, and start with the criticals 5 min after that.
I ended up needing to disable alerts for that service to get it to shut up.
After a short time, it would start alerting again....it hadn't recovered, so it should not have restarted complaining again. Any ideas why it would do that? Other than missing me not being at work?
Self un-Acknowledgment
Self un-Acknowledgment
Everybody is somebody else’s weirdo
Re: Self un-Acknowledgment
Can you post screenshots of State History and Notifications reports showing the service in question during this timeperiod?
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Self un-Acknowledgment
Warnings are set at 93% and critical at 95%
You do not have the required permissions to view the files attached to this post.
Everybody is somebody else’s weirdo
Re: Self un-Acknowledgment
Did the service recover sometime between 02:00:02 and 03:26:17, and 03:55:01 and 04:01:04? You didn't show us a screenshot from the "State History" report...
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Self un-Acknowledgment
between 2 and 3:26 it remained silent....but it it did start bleating again at 3:26
You do not have the required permissions to view the files attached to this post.
Everybody is somebody else’s weirdo
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Self un-Acknowledgment
Do you have is_volitile set for the service either in the definition or an underlying template?
http://nagios.sourceforge.net/docs/nagi ... vices.html
http://nagios.sourceforge.net/docs/nagi ... vices.html
Re: Self un-Acknowledgment
No I do not. It is just a hard drive check, nothing fancy
Everybody is somebody else’s weirdo
-
jdalrymple
- Skynet Drone
- Posts: 2620
- Joined: Wed Feb 11, 2015 1:56 pm
Re: Self un-Acknowledgment
This *looks* to me like you didn't check the sticky acknowledgment checkbox. Did you/Do you?
Re: Self un-Acknowledgment
Didn't do anything different that what I've done any other time I acknowledged an alert.
It's defaulted to being 'sticky'
It's defaulted to being 'sticky'
Everybody is somebody else’s weirdo
-
jdalrymple
- Skynet Drone
- Posts: 2620
- Joined: Wed Feb 11, 2015 1:56 pm
Re: Self un-Acknowledgment
I think in order to understand better what happened there we'll have to sort through your nagios.log about that time.
It might be more desirable if you can recreate the issue - but if not can we grab the logs around one of the times you experienced the problem over the weekend?
Maybe a full service definition too including any templates you have customized.
You're right - if sticky notifications was checked then the behavior you have depicted is improper. Head scratcher.
It might be more desirable if you can recreate the issue - but if not can we grab the logs around one of the times you experienced the problem over the weekend?
Maybe a full service definition too including any templates you have customized.
You're right - if sticky notifications was checked then the behavior you have depicted is improper. Head scratcher.