Page 1 of 1
Acknowledgement Data missing
Posted: Fri Nov 28, 2014 2:41 am
by rajasegar
Nagios XI 2014R1.2
Yesterday we had an incident where the partition hosting Nagios became full.
Today we noticed all the acknowledgement data is missing
Please advice
1) Where is the acknowledgement, comments etc stored? If DB which table.
2) How to restore the data from backup
Thanks
Re: Acknowledgement Data missing
Posted: Fri Nov 28, 2014 7:52 pm
by Box293
Where abouts are you noticing all the acknowledgement data is missing. Some screenshots will help.
Re: Acknowledgement Data missing
Posted: Sat Nov 29, 2014 7:58 am
by rajasegar
Box293 wrote:Where abouts are you noticing all the acknowledgement data is missing. Some screenshots will help.
From the monitors itself, all previous acknowledgement is gone from host & service history.
Only the ones added on that day are there.
Same thing from Incident Management->Acknowledgement & Comments.
Previous scheduled tasks history are also gone.
In short it is gone from everywhere.
Re: Acknowledgement Data missing
Posted: Mon Dec 01, 2014 11:50 am
by abrist
If retention.dat could not be written out, runtime state data (acks,comments, etc) will be lost. As you had a partition filled, that would be my guess.
Re: Acknowledgement Data missing
Posted: Mon Dec 01, 2014 5:58 pm
by rajasegar
abrist wrote:If retention.dat could not be written out, runtime state data (acks,comments, etc) will be lost. As you had a partition filled, that would be my guess.
So are you saying the acknowledgement, comments are stored in retention.dat?
Re: Acknowledgement Data missing
Posted: Mon Dec 01, 2014 6:02 pm
by abrist
Yes.
Re: Acknowledgement Data missing
Posted: Mon Dec 01, 2014 6:08 pm
by rajasegar
abrist wrote:Yes.
I recall Nagios support asked me to delete the retention.dat as part of the solution to solve the upgrade problem.
Nobody told us about the side effects. Lucky I reverted back else there would have been big time noise from the monitoring team.
Re: Acknowledgement Data missing
Posted: Tue Dec 02, 2014 11:25 am
by slansing
That is correct, there are some issues that can be "jump-start" resolved by removing retention.dat, however, it does get re-created upon restart if it was missing. As abrist noted, it contains things such as state information, comments, and downtime so it can be used to bring your nagios system back to where it was if the main process is killed off and restarted.