Page 1 of 1

Upgrade 5.4.13 to 5.5.3 failed

Posted: Wed Sep 19, 2018 5:25 pm
by lytxnoc
CentOS release 6.7 (Final)
CPU(s): 8 cores
RAM : 8GB
OLD Nagios XI Version: 5.4.13

We tried the upgrade to 5.5.3 and it failed with the following errors:

Website: https://www.nagios.org
Reading configuration data...
Read main config file okay...
Error: Template '' specified in service definition could not be not found (config file '/usr/local/nagios/etc/servicetemplates.cfg', starting on line 1146)
Error: Could not find a service matching host name 'sdi1v-opsmon01.drivecam.net' and description 'NSClient Service Status' (config file '/usr/local/nagios/etc/servicedependencies.cfg', starting on line 457)
Error: Could not find a service matching host name 'bcntap1.drivecam.net' and description 'bcntap1_netapp_volumes' (config file '/usr/local/nagios/etc/servicedependencies.cfg', starting on line 457)
Error: Could not expand master service(s) (config file '/usr/local/nagios/etc/servicedependencies.cfg', starting at line 457)
Error: Could not expand dependent service(s) (at config file '/usr/local/nagios/etc/servicedependencies.cfg', starting on line 457)
Error processing object config files!

When we ran the pre-flight check, there were no errors. Below is the log of the check:

Checking objects...
Warning: Service 'Guest Snapshots - Datacenter - KIO' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_IBM - Cluster DRS Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_IBM - Cluster HA Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_IBM - Cluster Swapfile Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_Lenovo - Cluster DRS Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_Lenovo - Cluster HA Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_Lenovo - Cluster Swapfile Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_Pod6-7_SQL - Cluster DRS Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_Pod6-7_SQL - Cluster HA Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Warning: Service 'Prod_Pod6-7_SQL - Cluster Swapfile Status' on host 'sdi1v-opsvcs01.drivecam.net' has a notification interval less than its check interval! Notifications are only re-sent after checks are made, so the effective notification interval will be that of the check interval.
Checked 10433 services.
Warning: Host '_test_event_handler' has no default contacts or contactgroups defined!
Warning: Host 'dcrepl1.drivecaminc.loc' has no default contacts or contactgroups defined!
Checked 977 hosts.
Checked 580 host groups.
Checked 83 service groups.
Checked 61 contacts.
Checked 24 contact groups.
Checked 298 commands.
Checked 75 time periods.
Checked 0 host escalations.
Checked 0 service escalations.
Checking for circular paths...
Checked 977 hosts
Checked 2204 service dependencies
Checked 838 host dependencies
Checked 75 timeperiods
Checking global event handlers...
Checking obsessive compulsive processor commands...
Checking misc settings...

Total Warnings: 12
Total Errors: 0

Things look okay - No serious problems were detected during the pre-flight check

Re: Upgrade 5.4.13 to 5.5.3 failed

Posted: Thu Sep 20, 2018 7:42 am
by scottwilkerson
Can you verify that apply configuration works from the web UI before running the upgrade,, it could be that there are changes in the CCM that have not been applied, that could have breaking configs.

Re: Upgrade 5.4.13 to 5.5.3 failed

Posted: Fri Sep 21, 2018 10:38 am
by lytxnoc
Yes, the apply configuration works from the UI.

Re: Upgrade 5.4.13 to 5.5.3 failed

Posted: Fri Sep 21, 2018 10:40 am
by scottwilkerson
Ok, lets run the upgrade again, and report any errors

Re: Upgrade 5.4.13 to 5.5.3 failed

Posted: Fri Sep 28, 2018 1:10 pm
by lytxnoc
Ran an update to 5.5.4 this time. Same error. I'm sending you the log attachment directly.

Re: Upgrade 5.4.13 to 5.5.3 failed

Posted: Fri Sep 28, 2018 1:20 pm
by scottwilkerson
Can you open a ticket here and reference this thread, I believe it may be best to do a remote session to resolve the issue

https://support.nagios.com/tickets/