Hi,
So this is state of play today have spent all weekend tweaking the kernel settings:
1. Nagios Core is Running and working without errors in the event logs
2. Nagios XI - ndo2db is stopped and when it starts it runs for 20 mins and then the kernel is exhausted and XI and Core crash. - I have to stop ndo2db and then run service nagios restart and nagios core starts working again.
I have left ndo2db stopped since 0430 today and Nagios Core has not crashed, the only thing now additional to this is the following errors are now appearing in the /var/log/messages
Code: Select all
Jun 9 10:25:40 qualngs rrdcached[17046]: queue_thread_main: rrd_update_r (/usr/local/nagios/share/perfdata/invu.centerprise.co.uk/#Process_CPU_Consumption.rrd) failed with status -1. (/usr/local/nagios/share/perfdata/invu.centerprise.co.uk/#Process_CPU_Consumption.rrd: expected 58 data source readings (got 57) from 1402305008)
Jun 9 10:25:41 qualngs rrdcached[17046]: queue_thread_main: rrd_update_r (/usr/local/nagios/share/perfdata/qls-data.local.qualitas-it.net/#Process_CPU_Consumption.rrd) failed with status -1. (/usr/local/nagios/share/perfdata/qls-data.local.qualitas-it.net/#Process_CPU_Consumption.rrd: found extra data on update argument: 100.0)
Jun 9 10:26:21 qualngs rrdcached[17046]: queue_thread_main: rrd_update_r (/usr/local/nagios/share/perfdata/qls-sql1.local.qualitas-it.net/#Process_CPU_Consumption.rrd) failed with status -1. (/usr/local/nagios/share/perfdata/qls-sql1.local.qualitas-it.net/#Process_CPU_Consumption.rrd: expected 82 data source readings (got 80) from 1402305030)
Jun 9 10:26:22 qualngs rrdcached[17046]: queue_thread_main: rrd_update_r (/usr/local/nagios/share/perfdata/bcotdfs06.bcotac.local/#Process_CPU_Consumption.rrd) failed with status -1. (/usr/local/nagios/share/perfdata/bcotdfs06.bcotac.local/#Process_CPU_Consumption.rrd: expected 78 data source readings (got 71) from 1402305064)
This is the current output from the
sysctl -p
Code: Select all
net.ipv4.ip_forward = 0
net.ipv4.conf.default.rp_filter = 1
net.ipv4.conf.default.accept_source_route = 0
kernel.sysrq = 0
kernel.core_uses_pid = 1
net.ipv4.tcp_syncookies = 1
error: "net.bridge.bridge-nf-call-ip6tables" is an unknown key
error: "net.bridge.bridge-nf-call-iptables" is an unknown key
error: "net.bridge.bridge-nf-call-arptables" is an unknown key
kernel.msgmnb = 262144000
kernel.msgmax = 262144000
kernel.shmmax = 4294967295
kernel.shmall = 268435456
kernel.msgmni = 512000
Code: Select all
[root@qualngs ~]# service nagios status
nagios (pid 60730) is running...
[root@qualngs ~]# service ndo2db status
ndo2db (pid 25309) is running...
[root@qualngs ~]# service mysqld status
mysqld (pid 1989) is running...
[root@qualngs ~]# service postgresql status
postmaster (pid 2038) is running...
[root@qualngs ~]# service crond status
crond (pid 2165) is running...
[root@qualngs ~]# service httpd status
httpd (pid 2157) is running...
tail /var/lib/pgsql/data/pg_log/postgresql-Mon.log
Code: Select all
ERROR: relation "xi_notifications" does not exist
STATEMENT: VACUUM ANALYZE xi_notifications;
ERROR: relation "xi_notifications" does not exist
STATEMENT: VACUUM ANALYZE xi_notifications;
ERROR: relation "xi_notifications" does not exist
STATEMENT: VACUUM ANALYZE xi_notifications;
ERROR: relation "xi_notifications" does not exist
STATEMENT: VACUUM ANALYZE xi_notifications;
ERROR: relation "xi_notifications" does not exist
STATEMENT: VACUUM ANALYZE xi_notifications;