CPU utilization for nagios server high
Posted: Sun Oct 21, 2012 2:22 am
Hi,
I have a nagios server installed monitoring aroung 90 hosts and around 500 services.
The CPU utilization always comes around 90%, and most of the CPU load is taken by nagios processes:
See the output below:
[root@msmontest01 ~]# ps -eo pcpu,pid,user,args | sort -k 1 -r | head -25
%CPU PID USER COMMAND
9.0 25194 nagios [nagios]
8.4 30282 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
8.0 25189 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
7.0 25192 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
5.0 25195 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
5.0 25187 nagios /usr/local/nagios/libexec/check_nt -H ILREVAPP3.JDADELIVERS.COM -p 1248 -s NScl1kd8sz87 -v USEDDISKSPACE -l c -w 80 -c 90
3.5 25186 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
2.3 25085 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
1.7 25082 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
1.6 25086 nagios /usr/local/nagios/libexec/check_ping -H ILSRGAPP1.JDADELIVERS.COM -w 200,80% -c 300,90% -p 5
1.5 25061 nagios [nagios]
1.2 25083 nagios /usr/local/nagios/libexec/check_ping -H ILREVWEB6.JDADELIVERS.COM -w 200,80% -c 300,90% -p 5
1.0 25193 nagios [nagios]
1.0 25190 nagios [nagios]
0.1 24389 root /usr/bin/perl /usr/local/JDA/bin/icmp_monitor_mod
0.0 993 ssadm -bash
0.0 989 ssadm sshd: ssadm@pts/0
0.0 985 root sshd: ssadm [priv]
0.0 8248 apache /usr/sbin/httpd
0.0 615 root /sbin/udevd -d
0.0 5 root [khelper]
0.0 582 root [kauditd]
0.0 556 root [kjournald]
0.0 539 root [ksnapd]
Here, I can see multiple nagios daemons running. Only the main nagios daemon runs permenantly and the other daemons only stays for few seconds.
Is this normal? And why the Utilization gets so high? When I restart nagios, the utilization becomes normal, but it builds up gradually, and reaches 90% after around 24 hours.
My Cpu info:
[root@msmontest01 ~]# cat /proc/cpuinfo
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 26
model name : Intel(R) Xeon(R) CPU X7560 @ 2.27GHz
stepping : 4
cpu MHz : 2394.000
cache size : 24576 KB
fpu : yes
fpu_exception : yes
cpuid level : 11
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss syscall nx rdtscp lm constant_tsc up ida nonstop_tsc pni cx16 popcnt lahf_lm
bogomips : 4788.00
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management: [8]
Thanks in advance.
Regards,
Ajmal
I have a nagios server installed monitoring aroung 90 hosts and around 500 services.
The CPU utilization always comes around 90%, and most of the CPU load is taken by nagios processes:
See the output below:
[root@msmontest01 ~]# ps -eo pcpu,pid,user,args | sort -k 1 -r | head -25
%CPU PID USER COMMAND
9.0 25194 nagios [nagios]
8.4 30282 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
8.0 25189 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
7.0 25192 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
5.0 25195 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
5.0 25187 nagios /usr/local/nagios/libexec/check_nt -H ILREVAPP3.JDADELIVERS.COM -p 1248 -s NScl1kd8sz87 -v USEDDISKSPACE -l c -w 80 -c 90
3.5 25186 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
2.3 25085 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
1.7 25082 nagios /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
1.6 25086 nagios /usr/local/nagios/libexec/check_ping -H ILSRGAPP1.JDADELIVERS.COM -w 200,80% -c 300,90% -p 5
1.5 25061 nagios [nagios]
1.2 25083 nagios /usr/local/nagios/libexec/check_ping -H ILREVWEB6.JDADELIVERS.COM -w 200,80% -c 300,90% -p 5
1.0 25193 nagios [nagios]
1.0 25190 nagios [nagios]
0.1 24389 root /usr/bin/perl /usr/local/JDA/bin/icmp_monitor_mod
0.0 993 ssadm -bash
0.0 989 ssadm sshd: ssadm@pts/0
0.0 985 root sshd: ssadm [priv]
0.0 8248 apache /usr/sbin/httpd
0.0 615 root /sbin/udevd -d
0.0 5 root [khelper]
0.0 582 root [kauditd]
0.0 556 root [kjournald]
0.0 539 root [ksnapd]
Here, I can see multiple nagios daemons running. Only the main nagios daemon runs permenantly and the other daemons only stays for few seconds.
Is this normal? And why the Utilization gets so high? When I restart nagios, the utilization becomes normal, but it builds up gradually, and reaches 90% after around 24 hours.
My Cpu info:
[root@msmontest01 ~]# cat /proc/cpuinfo
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 26
model name : Intel(R) Xeon(R) CPU X7560 @ 2.27GHz
stepping : 4
cpu MHz : 2394.000
cache size : 24576 KB
fpu : yes
fpu_exception : yes
cpuid level : 11
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss syscall nx rdtscp lm constant_tsc up ida nonstop_tsc pni cx16 popcnt lahf_lm
bogomips : 4788.00
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management: [8]
Thanks in advance.
Regards,
Ajmal