Hi Team,
When we do applying configuration post that monitoring engine goes stop, also external commands also goes auto stop. Cause of checks not happen and Nagios look like as stand by. Currently we are monitoring approx 1300 hosts and 6000 services.
please find the error screenshot as well Nagios system profile.
Monitoring engine not working
-
wiproltdwiv
- Posts: 281
- Joined: Sat Sep 08, 2012 12:52 am
Monitoring engine not working
You do not have the required permissions to view the files attached to this post.
Re: Monitoring engine not working
wiproltdwiv ,
Is there any error message when you apply configuration?
-Yancy
Is there any error message when you apply configuration?
-Yancy
-
wiproltdwiv
- Posts: 281
- Joined: Sat Sep 08, 2012 12:52 am
Re: Monitoring engine not working
No, it show apply configuration successfully.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Monitoring engine not working
Can you post your nagios.cfg
Also, at what frequency are you monitoring the 1300 hosts and 6000 services?
What are the specs on your server CPU's, RAM
Also, at what frequency are you monitoring the 1300 hosts and 6000 services?
What are the specs on your server CPU's, RAM
-
wiproltdwiv
- Posts: 281
- Joined: Sat Sep 08, 2012 12:52 am
Re: Monitoring engine not working
we have assign 10 to 15 check interval for all hosts and services. i have attached nagios.cfg also below are the hardware details.
[root@EMSNagios1 etc]# cat /proc/cpuinfo |more
processor : 23
vendor_id : GenuineIntel
cpu family : 6
model : 44
model name : Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
stepping : 2
cpu MHz : 1600.000
cache size : 12288 KB
physical id : 1
siblings : 12
core id : 10
cpu cores : 6
apicid : 53
initial apicid : 53
fpu : yes
fpu_exception : yes
cpuid level : 11
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdts
cp lm constant_tsc arch_perfmon pebs bts rep_good xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm d
ca sse4_1 sse4_2 popcnt aes lahf_lm ida arat epb dts tpr_shadow vnmi flexpriority ept vpid
bogomips : 4799.89
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management:
[root@EMSNagios1 etc]# uname -a
Linux EMSNagios1.co-opbank.co.in 2.6.32-279.2.1.el6.x86_64 #1 SMP Thu Jul 5 21:08:58 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux
[root@EMSNagios1 etc]# free -g
total used free shared buffers cached
Mem: 31 6 25 0 0 1
-/+ buffers/cache: 4 26
Swap: 15 0 15
[root@EMSNagios1 etc]#
[root@EMSNagios1 etc]# cat /proc/cpuinfo |more
processor : 23
vendor_id : GenuineIntel
cpu family : 6
model : 44
model name : Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
stepping : 2
cpu MHz : 1600.000
cache size : 12288 KB
physical id : 1
siblings : 12
core id : 10
cpu cores : 6
apicid : 53
initial apicid : 53
fpu : yes
fpu_exception : yes
cpuid level : 11
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdts
cp lm constant_tsc arch_perfmon pebs bts rep_good xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm d
ca sse4_1 sse4_2 popcnt aes lahf_lm ida arat epb dts tpr_shadow vnmi flexpriority ept vpid
bogomips : 4799.89
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management:
[root@EMSNagios1 etc]# uname -a
Linux EMSNagios1.co-opbank.co.in 2.6.32-279.2.1.el6.x86_64 #1 SMP Thu Jul 5 21:08:58 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux
[root@EMSNagios1 etc]# free -g
total used free shared buffers cached
Mem: 31 6 25 0 0 1
-/+ buffers/cache: 4 26
Swap: 15 0 15
[root@EMSNagios1 etc]#
You do not have the required permissions to view the files attached to this post.
Re: Monitoring engine not working
winproltdwiv,
Are you able to roll back to a working snapshot?
Core Config Manager->Configuration Snapshots
Thanks,
-Yancy
Are you able to roll back to a working snapshot?
Core Config Manager->Configuration Snapshots
Thanks,
-Yancy
-
wiproltdwiv
- Posts: 281
- Joined: Sat Sep 08, 2012 12:52 am
Re: Monitoring engine not working
I can, but i am not getting any error snapshot all are getting successfully only.
Re: Monitoring engine not working
It seems like your system could be overtaxed, either with Disk IO, or CPU load. The following document might be worth a read.
http://assets.nagios.com/downloads/nagi ... rmance.pdf
http://assets.nagios.com/downloads/nagi ... rmance.pdf
-
wiproltdwiv
- Posts: 281
- Joined: Sat Sep 08, 2012 12:52 am
Re: Monitoring engine not working
I went through above doc, but as per our system h/w configuration and attached system status, i dont think we have that much load. Please check and suggest where we need to changes
You do not have the required permissions to view the files attached to this post.
-
slansing
- Posts: 7698
- Joined: Mon Apr 23, 2012 4:28 pm
- Location: Travelling through time and space...
Re: Monitoring engine not working
Have you tried rolling back to a previous snapshot? Regardless if they are stamped with error or are clean.