Re:Nagios XI Performance Issue
Posted: Thu Oct 17, 2013 12:19 pm
Hi Team,
We are configuring largest environment to monitor the devices like approx 4000 devices with 40,000 service checks where mostly would be Active checks
My Hardware Specs are as below:
We are maintaining NagiosXI Server and Offloaded DB as separate Servers in Virtual Environment.
Nagios XI Server and Offloaded DB Hardware Details which are same for both servers.
RAM : 32GB
16 Cores CPU
1TB HDD ( present using 300GB ) and add remaining HDD Space later
SAN Storage with SAS Drives and 1000 IOPS at present
Presently, we configured 1800 devices with 10,000 services where nearly 4000 are SNMP enabled configured from Network Switch/Router Wizard, 2500 services are from VMware ESX monitor and remaining 3500 service checks are from auto discover for different servers.
PHP.ini limits are provided as
1. Maximum Execution time = 1200
2. Maximum Input time = 90
3. memory limit = 4096M
>> We splitted up our mrtg.cfg into multiple files to reduce the load and done RAM Disk to reduce some IOwait..
>> For network devices, we given check interval time as 6 minute and for other device, we given 10 minute interval and all are active checks
Present Problem is
1. If we open the Nagios XI in browser, it is responding very slow.
2. Sometimes, if i do apply configuration or if i add any device, it is taking too much time lets say some hours to apply it.
3. Sometimes, "Monitoring Engine is stopping" and graph which is besides to monitoring engine queue is showing sudden spike for 10,000 checks and stopping all service check execution. It is then working fine once we restart the nagios service or restarting the monitoring engine. At tat moment, graphs are not generating.
4. From SAR Report, IOwait output is 10% and CPU load average output is 30%
Please suggest us to solve this issue as we are not able to add any new devices due to the low performance..
Provide us any tuning parameters need to apply on Nagios XI Server..
One more Query:
Is there any option to modify options for hostgroups/service groups using Bulk Modification Tool.. If so, please let us know..
Thanks in Advance.. Awaiting for ur reply...
We are configuring largest environment to monitor the devices like approx 4000 devices with 40,000 service checks where mostly would be Active checks
My Hardware Specs are as below:
We are maintaining NagiosXI Server and Offloaded DB as separate Servers in Virtual Environment.
Nagios XI Server and Offloaded DB Hardware Details which are same for both servers.
RAM : 32GB
16 Cores CPU
1TB HDD ( present using 300GB ) and add remaining HDD Space later
SAN Storage with SAS Drives and 1000 IOPS at present
Presently, we configured 1800 devices with 10,000 services where nearly 4000 are SNMP enabled configured from Network Switch/Router Wizard, 2500 services are from VMware ESX monitor and remaining 3500 service checks are from auto discover for different servers.
PHP.ini limits are provided as
1. Maximum Execution time = 1200
2. Maximum Input time = 90
3. memory limit = 4096M
>> We splitted up our mrtg.cfg into multiple files to reduce the load and done RAM Disk to reduce some IOwait..
>> For network devices, we given check interval time as 6 minute and for other device, we given 10 minute interval and all are active checks
Present Problem is
1. If we open the Nagios XI in browser, it is responding very slow.
2. Sometimes, if i do apply configuration or if i add any device, it is taking too much time lets say some hours to apply it.
3. Sometimes, "Monitoring Engine is stopping" and graph which is besides to monitoring engine queue is showing sudden spike for 10,000 checks and stopping all service check execution. It is then working fine once we restart the nagios service or restarting the monitoring engine. At tat moment, graphs are not generating.
4. From SAR Report, IOwait output is 10% and CPU load average output is 30%
Please suggest us to solve this issue as we are not able to add any new devices due to the low performance..
Provide us any tuning parameters need to apply on Nagios XI Server..
One more Query:
Is there any option to modify options for hostgroups/service groups using Bulk Modification Tool.. If so, please let us know..
Thanks in Advance.. Awaiting for ur reply...