Core Worker timed out and failed to reap child
Re: Core Worker timed out and failed to reap child
If we do have a situation where the server is just overloaded, what types of symptoms would we be seeing?
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Core Worker timed out and failed to reap child
Hi Henry,
You would see a high CPU load and high check latency since Nagios is unable to schedule the checks on time and this is what I'm seeing in the system profile.
Top Command Shows High CPU Load
Kernal Message Queues ( slow to write results to the databsae)
I noticed that you have quite a few NCPA checks setup for an interval of every 3 minutes, if you can increase it 5 minutes that would help substantially.
The system has 8 Single Core CPU's, if you're able to add more that would help since scheduling active host and service check is CPU intensive.
Other options to reduce load would be to set up passive checks or integrating Mod Gearman.
Using NCPA For Passive Check
Integrating Mod-Gearman With Nagios XI
You would see a high CPU load and high check latency since Nagios is unable to schedule the checks on time and this is what I'm seeing in the system profile.
Top Command Shows High CPU Load
Code: Select all
top - 07:54:43 up 5 days, 23:21, 1 user, load average: 320.75, 313.22, 406.45
Tasks: 592 total, 177 running, 411 sleeping, 0 stopped, 4 zombie
Code: Select all
------ Message Queues --------
key msqid owner perms used-bytes messages
0xd7000002 17 nagios 600 12948480 12645
The system has 8 Single Core CPU's, if you're able to add more that would help since scheduling active host and service check is CPU intensive.
Other options to reduce load would be to set up passive checks or integrating Mod Gearman.
Using NCPA For Passive Check
Integrating Mod-Gearman With Nagios XI
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!
Re: Core Worker timed out and failed to reap child
I have reached out to our server team. They will be adding CPU's next week Friday and the following Friday.
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Core Worker timed out and failed to reap child
Hi Henry,
Great. Let us know the results.I have reached out to our server team. They will be adding CPU's next week Friday and the following Friday.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!