Page 1 of 1

Nagios box with disk space issues losing acknowledgements

Posted: Mon Oct 04, 2021 10:29 am
by optionstechnology
So, sometimes my Nagios boxes die, and due to spooling checks/perfdata/snmp traps they run out of disk space

Problem is, when nagios is restarted it dumps all of the current acknowledgements to file - but since it has no space for it, it instead essentially deletes all acks and we have to go though a lengthy process of figuring out which ticket in our system goes with which ack

I don't really want to go down the road of persistent comments as I then have to go and clear them out every so often

There has got to be a better way of doing this - unless there is an obvious answer I'm missing?

The only thing i can think of is randomly stop/starting the service and making a copy of the retention file, which is a really messy solution

Re: Nagios box with disk space issues losing acknowledgement

Posted: Tue Oct 05, 2021 2:45 pm
by pbroste
Hello @optionstechnology

Thanks for reaching out on acknowledgment options.

Took look at this on my test VM and you are correct the best solution would be to create a cron job that creates a copy (cat /usr/local/nagios/var/retention.dat > /mnt/some/place/different.txt) offline somewhere.

Depending on how busy things are in your environment, you may be able to 'cat' the retention.dat without stopping service.

Thanks,
Perry