High load after upgrade to 5.4.2 - large # of cron jobs
Posted: Thu Feb 16, 2017 1:44 pm
Hello,
We've been running Nagios XI for some years on an older server that is starting to fail. I had already planned on migrating to a new machine, and when the old one started locking up on a regular basis I accelerated the process.
The old machine was a 32-bit machine running Centos 5.x. It had a single processor and was maxed out at 4GB of ram, but ran Nagios XI decently. I built the new machine with 4 processor cores and 8GB of ram, on a 64-bit machine running Centos 7.x. I installed Nagios XI version 5.4.2 on the new machine, and per the migration instructions, upgraded the old machine to 5.4.2 (it was running a recent version, but not that one). Immediately after the upgrade the load went way up on the old machine from an average of 3.x to 60+, and remained high.
The machine also locked up shortly thereafter, so I proceeded with the migration, fearing the old machine would die completely before long. I was able to accomplish the migration, including converting the rrd databases, and shortly thereafter the new machine's load shot up as well. It's now over 60, and has been that high since yesterday. This is seriously impacting performance, and some of the checks are timing out because of it.
What I'm seeing is that perfdataproc.php is spawning every five minutes and not completing. There are 190 instances of it running right now.
I have some other migration issues, but until I get the load under control I can't be sure if they're due to the load or external factors.
If there's something simple I should do I'd like to hear about it. If this is likely going to be a difficult issue I'll open a support case.
To recap - this happened on the old machine immediately after upgrading, and also happened on the new machine as soon as I did a restore from the old machine, so I think it is related to the old machine's configuration.
The new machine is running in a VM, so I can increase resources if necessary, but having given it 4x the processor power and 2x the memory, plus substantially faster disks, I don't think offhand it's a resource starvation issue.
Thanks.
We've been running Nagios XI for some years on an older server that is starting to fail. I had already planned on migrating to a new machine, and when the old one started locking up on a regular basis I accelerated the process.
The old machine was a 32-bit machine running Centos 5.x. It had a single processor and was maxed out at 4GB of ram, but ran Nagios XI decently. I built the new machine with 4 processor cores and 8GB of ram, on a 64-bit machine running Centos 7.x. I installed Nagios XI version 5.4.2 on the new machine, and per the migration instructions, upgraded the old machine to 5.4.2 (it was running a recent version, but not that one). Immediately after the upgrade the load went way up on the old machine from an average of 3.x to 60+, and remained high.
The machine also locked up shortly thereafter, so I proceeded with the migration, fearing the old machine would die completely before long. I was able to accomplish the migration, including converting the rrd databases, and shortly thereafter the new machine's load shot up as well. It's now over 60, and has been that high since yesterday. This is seriously impacting performance, and some of the checks are timing out because of it.
What I'm seeing is that perfdataproc.php is spawning every five minutes and not completing. There are 190 instances of it running right now.
I have some other migration issues, but until I get the load under control I can't be sure if they're due to the load or external factors.
If there's something simple I should do I'd like to hear about it. If this is likely going to be a difficult issue I'll open a support case.
To recap - this happened on the old machine immediately after upgrading, and also happened on the new machine as soon as I did a restore from the old machine, so I think it is related to the old machine's configuration.
The new machine is running in a VM, so I can increase resources if necessary, but having given it 4x the processor power and 2x the memory, plus substantially faster disks, I don't think offhand it's a resource starvation issue.
Thanks.