So, sometimes my Nagios boxes die, and due to spooling checks/perfdata/snmp traps they run out of disk space
Problem is, when nagios is restarted it dumps all of the current acknowledgements to file - but since it has no space for it, it instead essentially deletes all acks and we have to go though a lengthy process of figuring out which ticket in our system goes with which ack
I don't really want to go down the road of persistent comments as I then have to go and clear them out every so often
There has got to be a better way of doing this - unless there is an obvious answer I'm missing?
The only thing i can think of is randomly stop/starting the service and making a copy of the retention file, which is a really messy solution
Nagios box with disk space issues losing acknowledgements
-
optionstechnology
- Posts: 234
- Joined: Thu Nov 17, 2016 11:26 am
Re: Nagios box with disk space issues losing acknowledgement
Hello @optionstechnology
Thanks for reaching out on acknowledgment options.
Took look at this on my test VM and you are correct the best solution would be to create a cron job that creates a copy (cat /usr/local/nagios/var/retention.dat > /mnt/some/place/different.txt) offline somewhere.
Depending on how busy things are in your environment, you may be able to 'cat' the retention.dat without stopping service.
Thanks,
Perry
Thanks for reaching out on acknowledgment options.
Took look at this on my test VM and you are correct the best solution would be to create a cron job that creates a copy (cat /usr/local/nagios/var/retention.dat > /mnt/some/place/different.txt) offline somewhere.
Depending on how busy things are in your environment, you may be able to 'cat' the retention.dat without stopping service.
Thanks,
Perry