a few questions for you:
- do you have certain checks that fail, or keep threads open? SNMP can cause this as the timeout is quite long.
- what type of disks is the server running on? performance wise, you're getting close to what normal hardware can handle before it needs to be tweaked.
- is this a physical machine or a VM?
Number of Nagios workers causing interruptions
- tacolover101
- Posts: 432
- Joined: Mon Apr 10, 2017 11:55 am
-
kyang
Re: Number of Nagios workers causing interruptions
Thanks for the help @tacolover101!
reincarne, please respond to tacolover's questions and let us know the answers.
Also, is the DB offloaded still or back in the XI server?
reincarne, please respond to tacolover's questions and let us know the answers.
Also, is the DB offloaded still or back in the XI server?
Re: Number of Nagios workers causing interruptions
Well, there are some checks that fails - some of them as a result of a real issue, some of them can be caused by security issues etc.tacolover101 wrote:a few questions for you:
- do you have certain checks that fail, or keep threads open? SNMP can cause this as the timeout is quite long.
- what type of disks is the server running on? performance wise, you're getting close to what normal hardware can handle before it needs to be tweaked.
- is this a physical machine or a VM?
Still, why Nagios has to create new workers? New workers sort of create zombies which then keep some old data mixed with an updated data and causing load on the server.
We fixed it by creating a crontab job that will monitor number of workers and kill zombies
-
dwhitfield
- Former Nagios Staff
- Posts: 4583
- Joined: Wed Sep 21, 2016 10:29 am
- Location: NoLo, Minneapolis, MN
- Contact:
Re: Number of Nagios workers causing interruptions
The workers shouldn't be zombies, but it sounds like you've got a resolution. Are we ready to lock this up?reincarne wrote: We fixed it by creating a crontab job that will monitor number of workers and kill zombies