Problems after upgradee to 5.5

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
nik.vu
Posts: 34
Joined: Tue Feb 07, 2017 4:28 pm

Re: Problems after upgradee to 5.5

Post by nik.vu »

I think i found the problem, when host goes down in 5.5 nagios constantly recheck every 1 minutes, tottaly different than in 5.4x. As you can see from atachement, this host is down for more than 4 hours and nagios still checking it on one minute interval. Thats tottaly opposite from config where check interval is set to 15 minutes. Like in 5.5 retry interval constantly check for down hosts and services, and this is real problem for setups that have more than 10.000 checks.

When i restore to previus version everything is ok with load, checks and XI works fine.

Code: Select all

define host {
    host_name                   KI_W009-KikindaGraditelj
    use                         xiwizard_switch_host
    address                     172.21.64.52
    parents                     Kikinda WIFI linkovi1
    hostgroups                  Kikinda
    max_check_attempts          5
    check_interval              15
    retry_interval              1
    check_period                xi_timeperiod_24x7
    contacts                    l1.provera,nagiosadmin
    notification_interval       0
    notification_period         xi_timeperiod_24x7
    first_notification_delay    10
    icon_image                  switch.png
    statusmap_image             switch.png
    _xiwizard                   switch
    register                    1
}
You do not have the required permissions to view the files attached to this post.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Problems after upgradee to 5.5

Post by scottwilkerson »

Thanks for putting this together, and we have confirmed this is a bug.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Envera IT
Posts: 159
Joined: Wed Jun 19, 2013 10:21 am

Re: Problems after upgradee to 5.5

Post by Envera IT »

I can confirm we're also seeing a massive increase in the CPU load after the 5.5 upgrade. ~13k checks here with around 500 being down or unreachable at any given time. CPU load has quadrupled.
I like graphs...
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Problems after upgradee to 5.5

Post by tgriep »

Can you run the following commands as root and post the output to the forum so we can see what is taking the most processing at this time?

Code: Select all

top -n 1
ps -ef --cols-300
ipcs -q
Thanks
Be sure to check out our Knowledgebase for helpful articles and solutions!
Envera IT
Posts: 159
Joined: Wed Jun 19, 2013 10:21 am

Re: Problems after upgradee to 5.5

Post by Envera IT »

tgriep wrote:Can you run the following commands as root and post the output to the forum so we can see what is taking the most processing at this time?

Code: Select all

top -n 1
ps -ef --cols-300
ipcs -q
Thanks
So I looked through some other threads and ran

Code: Select all

service nagios stop
killall -9 nagios
service nagios start
My issues resolved soon after and load dropped back down to normal levels.
I like graphs...
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Problems after upgradee to 5.5

Post by tmcdonald »

Good to hear. Did you have further (related) questions or are we good to lock this up?
Former Nagios employee
Envera IT
Posts: 159
Joined: Wed Jun 19, 2013 10:21 am

Re: Problems after upgradee to 5.5

Post by Envera IT »

tmcdonald wrote:Good to hear. Did you have further (related) questions or are we good to lock this up?
I'm not OP so will differ to him/her.
I like graphs...
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Problems after upgradee to 5.5

Post by tmcdonald »

Ehamby wrote:
tmcdonald wrote:Good to hear. Did you have further (related) questions or are we good to lock this up?
I'm not OP so will differ to him/her.
Ahh, thanks. Didn't scroll up far enough :)
Former Nagios employee
nik.vu
Posts: 34
Joined: Tue Feb 07, 2017 4:28 pm

Re: Problems after upgradee to 5.5

Post by nik.vu »

This isn't yet resolved, still is the same problem. Even with ok load, XI is unusable. I explained in the first post, when config is applied XI needs to much time to be back in normal state, also XI isn't doing checks in proper time. Check are late for about 30 minutes.

Also i need to much times to kill and restart nagios process via cli, when i restart process it's a little better, but thats not a proper solution.

On 5.4.3 with all these check everything is fine.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Problems after upgradee to 5.5

Post by scottwilkerson »

nik.vu wrote:I think i found the problem, when host goes down in 5.5 nagios constantly recheck every 1 minutes, tottaly different than in 5.4x. As you can see from atachement, this host is down for more than 4 hours and nagios still checking it on one minute interval. Thats tottaly opposite from config where check interval is set to 15 minutes. Like in 5.5 retry interval constantly check for down hosts and services, and this is real problem for setups that have more than 10.000 checks.
nik.vu,

This is a known issue we are working to get a resolution out soon in a 5.5.1 release, I would suspect late this week or early next.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked