Page 1 of 1

[Nagios-devel] Nagios and Gearman - huge environment performance

Posted: Fri Aug 19, 2011 1:29 pm
by Guest
--0016364d2065f3180504aadc8f94
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi everybody,

I=B4m testing Nagios and Gearman / Mod_Gearman. I=B4d like to change NSCA w=
ith
this new approach, as it seems easier to configure and has a lot of
advantages. Besides, NSCA and Nagios freshness mechanism have some problems=
.

Gearman and mod_gearman are working well. I have 30000 hosts and 60000
services, and it is increasing!

Now I=B4m having problem with Nagios performance, that eats 100% of CPU and
the host and service latency is very big, around 300 seconds. I think that
this a Nagios problem, as the gearman_top shows the Job Wainting queue empt=
y
almost all the time. It seems that Nagios do not send the active checks all
the time, an once in while it sends a burst of active checks.

I have a physical central server, running RHEL, with 4 GB of ram, Intel(R)
Xeon(R) CPU E5504 @ 2.00GHz (8 CPUs). For the workers I have 9 virtual
servers running RHEL too.

I've already set the Nagios parameters to large environment, as recommended
in the documentation, but it made no difference. Thanks.

Nagios Parameters to large environment:

- use_large_installation_tweaks=3D1

- enable_environment_macros=3D0

- max_concurrent_checks=3D0

- check_result_reaper_frequency=3D10
Could someone help me? How can I improve Nagios performance to make active
checks faster?

Thank you very much.

--0016364d2065f3180504aadc8f94
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi everybody,I=B4m testing Nagios and Gearman / Mod_Gearman. I=B4d =
like to change NSCA with this new approach, as it seems easier to configure=
and has a lot of advantages. Besides, NSCA and Nagios freshness mechanism =
have some problems.
Gearman and mod_gearman are working well. I have 30000 hosts and 60000 =
services, and it is increasing!Now I=B4m having problem with Nagios=
performance, that eats 100% of CPU
and the host and service latency is very big, around 300 seconds. I think t=
hat this a Nagios problem, as the gearman_top shows the Job Wainting queue =
empty almost all the time. It seems that Nagios do not send the active chec=
ks all the time, an once in while it sends a burst of active checks.
I have a physical central server, running RHEL, with 4 GB of ram, Intel=
(R) Xeon(R) CPU E5504=A0 @ 2.00GHz (8 CPUs). For the workers I have 9 virtu=
al servers running RHEL too.I've already set the Nagios paramet=
ers to large environment, as
recommended in the documentation, but it made no difference. Thanks.

Nagios Parameters to large environment: - use_large_installat=
ion_tweaks=3D1 - enable_environment_macros=3D0 - max_=
concurrent_checks=3D0
- check_result_reaper_frequency=3D10Could someone help me? How can I im=
prove Nagios performance to make active checks faster?Thank you ver=
y much.

--0016364d2065f3180504aadc8f94--





This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]