Nagios XI - Schedulded Backups No Longer Working


Problem Description

When looking at the location that the scheduled backups are stored in, there are no new xxxxxxx.tar.gz files being created in this location.

Additionally,  in the /store/backups/nagiosxi/ location there are lots of folders that are not being removed as per the Backup Limit setting.

 

Explanation

The root cause of the issue is that the backups are failing. When this happens the following occurs:

 

The most common cause of the backup failing is due to a database corruption.

 

Resolving The Problem

To identify why the backups are failing, you will need to execute a manual backup in an SSH session on your Nagios XI server. This will generate verbose output that will help determine why the backup is failing.

The procedure for performing a manual backup is explained in this document:

Documentation - Backing Up And Restoring Your Nagios XI System

As mentioned before, the most common cause of failing backups is a corrupt database. Here is an example of a failed backup due to a corrupt database:

Running configuration check...
Stopping nagios: done.
Starting nagios: done.
Backing up Core Config Manager (NagiosQL)...
tar: Removing leading `/' from member names
tar: Removing leading `/' from member names
Backing up Nagios Core...
tar: Removing leading `/' from member names
tar: /usr/local/nagios/var/ndo.sock: socket ignored
tar: /usr/local/nagios/var/rw/nagios.qh: socket ignored
Backing up Nagios XI...
tar: Removing leading `/' from member names
Backing up MRTG...
tar: Removing leading `/' from member names
Backing up NRDP...
tar: Removing leading `/' from member names
Backing up Nagvis...
tar: Removing leading `/' from member names
Backing up MySQL databases...
mysqldump: Got error: 130: Incorrect file format 'nagios_flappinghistory' when using LOCK TABLES
Error backing up MySQL database 'nagios' - check the password in this script!

 

The error "check the password in this script" is a generic message and is not the cause, the line before it explains the issue in this example.

Please follow this article to repair the database:

Documentation - Crashed Database Tables

 

Once the database has been repaired, run a manual backup to confirm that the issue has been resolved.

 

To remove the directories of failed backup files, execute the following command (this will not touch any .tar.gz files):

find /store/backups/nagiosxi/ -mindepth 1 -maxdepth 1 -type d -exec rm -rf '{}' \;

 

Watching Scheduled Backups

If you need to see what is actually occurring when the scheduled backup is run, you can watch the output live by executing the following command:

tail /usr/local/nagiosxi/var/cmdsubsys.log -f

Press CTRL + C when you have finished.

 

Final Thoughts

For any support related questions please visit the Nagios Support Forums at:

http://support.nagios.com/forum/



Article ID: 514
Created On: Fri, Jun 3, 2016 at 12:25 AM
Last Updated On: Fri, Apr 24, 2020 at 11:16 AM
Authored by: tlea

Online URL: https://support.nagios.com/kb/article/nagios-xi-schedulded-backups-no-longer-working-514.html