Page 1 of 1

Re:Nagios XI Performance Issue

Posted: Thu Oct 17, 2013 12:19 pm
by ctrlshawkkey
Hi Team,

We are configuring largest environment to monitor the devices like approx 4000 devices with 40,000 service checks where mostly would be Active checks

My Hardware Specs are as below:

We are maintaining NagiosXI Server and Offloaded DB as separate Servers in Virtual Environment.

Nagios XI Server and Offloaded DB Hardware Details which are same for both servers.

RAM : 32GB
16 Cores CPU
1TB HDD ( present using 300GB ) and add remaining HDD Space later
SAN Storage with SAS Drives and 1000 IOPS at present

Presently, we configured 1800 devices with 10,000 services where nearly 4000 are SNMP enabled configured from Network Switch/Router Wizard, 2500 services are from VMware ESX monitor and remaining 3500 service checks are from auto discover for different servers.

PHP.ini limits are provided as

1. Maximum Execution time = 1200
2. Maximum Input time = 90
3. memory limit = 4096M

>> We splitted up our mrtg.cfg into multiple files to reduce the load and done RAM Disk to reduce some IOwait..

>> For network devices, we given check interval time as 6 minute and for other device, we given 10 minute interval and all are active checks

Present Problem is

1. If we open the Nagios XI in browser, it is responding very slow.
2. Sometimes, if i do apply configuration or if i add any device, it is taking too much time lets say some hours to apply it.
3. Sometimes, "Monitoring Engine is stopping" and graph which is besides to monitoring engine queue is showing sudden spike for 10,000 checks and stopping all service check execution. It is then working fine once we restart the nagios service or restarting the monitoring engine. At tat moment, graphs are not generating.
4. From SAR Report, IOwait output is 10% and CPU load average output is 30%

Please suggest us to solve this issue as we are not able to add any new devices due to the low performance..

Provide us any tuning parameters need to apply on Nagios XI Server..

One more Query:

Is there any option to modify options for hostgroups/service groups using Bulk Modification Tool.. If so, please let us know..

Thanks in Advance.. Awaiting for ur reply...

Re: Re:Nagios XI Performance Issue

Posted: Thu Oct 17, 2013 12:23 pm
by abrist
Is this a continuation of the thread: http://support.nagios.com/forum/viewtop ... 563#p74257 ?

Re: Re:Nagios XI Performance Issue

Posted: Wed Oct 23, 2013 8:44 am
by ctrlshawkkey
Hi Team,

Yes, it is continuation of that thread and this user id is belongs to the customer who purchased Enterprise License.

As we are facing some performance issues from past 10 days and got so many inputs from your team which helps us to solve, but still facing so many issues regarding the performance...

Today, we are not able to see any graphs in Monitoring engine event queue/Schedule events over time and average host check latency and service check latency is increasing drastically...

We contacted storage team and internal team provided with 1900 IOPS..

Can you please check our configurations as we are not able to add any device from past 10 days...

Re: Re:Nagios XI Performance Issue

Posted: Wed Oct 23, 2013 10:41 am
by slansing
Does the user in question have access to the ticket system? If so I would advise we move this to a ticket so we can gather information that may involve "sensitive" data, as well as escalate to a remote session if needed.

Re: Re:Nagios XI Performance Issue

Posted: Wed Oct 23, 2013 11:01 am
by ctrlshawkkey
HI Team,

Yes.. Customer have ticketing system and just now we got confirmation from Nagios Sales team... So, can i move this issue into [email protected]..??

Re: Re:Nagios XI Performance Issue

Posted: Wed Oct 23, 2013 11:21 am
by slansing
Yes go ahead and do that.