replication issues

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
ravish78
Posts: 269
Joined: Wed Mar 14, 2012 9:50 am

replication issues

Post by ravish78 »

Team,
Our data replication is not working fine from slave to master.We are getting below errors in log messages in slave server.
Can you please look into this.

00007fffffffb610 error 6 in send_nsca[400000+5000]
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 041: Jun 17 04:16:48 usa0300lv846 abrt[7548]: Not saving repeating crash in '/usr/local/nagios/libexec/send_nsca'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 042: Jun 17 04:16:49 usa0300lv846 kernel: send_nsca[7630]: segfault at 7ffffffff000 ip 0000000000401c48 sp 00007fffffffb610 error 6 in send_nsca[400000+5000]
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 043: Jun 17 04:16:49 usa0300lv846 abrt[7930]: Not saving repeating crash in '/usr/local/nagios/libexec/send_nsca'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 044: Jun 17 04:17:02 usa0300lv846 kernel: send_nsca[8526]: segfault at 7ffffffff000 ip 0000000000401c48 sp 00007fffffffb610 error 6 in send_nsca[400000+5000]
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 045: Jun 17 04:17:02 usa0300lv846 abrt[8527]: Saved core dump of pid 8526 (/usr/local/nagios/libexec/send_nsca) to /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526 (380928 bytes)
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 046: Jun 17 04:17:02 usa0300lv846 abrtd: Executable '/usr/local/nagios/libexec/send_nsca' doesn't belong to any package and ProcessUnpackaged is set to 'no'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 047: Jun 17 04:17:02 usa0300lv846 abrt[8553]: Not saving repeating crash in '/usr/local/nagios/libexec/send_nsca'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 048: Jun 17 04:17:02 usa0300lv846 kernel: send_nsca[8552]: segfault at 7ffffffff000 ip 0000000000401c48 sp 00007fffffffb610 error 6 in send_nsca[400000+5000]
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: replication issues

Post by scottwilkerson »

How did you set this up?

It looks like send_nsca is segfaulting, but this could have any number of causes, could you attach the core dump from /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526 ?

Code: Select all

Saved core dump of pid 8526 (/usr/local/nagios/libexec/send_nsca) to /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526 (380928 bytes)
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
ravish78
Posts: 269
Joined: Wed Mar 14, 2012 9:50 am

Re: replication issues

Post by ravish78 »

It is not showing anything with that file name

tmproot@usa0300lv846>ls -lrt /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526
ls: cannot access /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526: No such file or directory

tmproot@usa0300lv846>ls -lart
total 16
-rw-r--r--. 1 root root 0 Mar 31 04:21 abrt-db
drwxr-xr-x. 13 root root 4096 Mar 31 04:22 ..
-rw------- 1 root root 12 Jun 11 10:09 last-via-server
-rw------- 1 root root 35 Jun 17 15:14 last-ccpp
drwxr-xr-x. 2 abrt abrt 4096 Jun 17 15:14 .
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: replication issues

Post by slansing »

It looks like it could possibly be contained in "last-ccpp", though I'd think it would be the file name that you saw in the log. Is that all you have in (/var/spool/abrt/) ?
ravish78
Posts: 269
Joined: Wed Mar 14, 2012 9:50 am

Re: replication issues

Post by ravish78 »

Yes only those 3 files are visible in that path.

tmproot@usa0300lv846>ls -lrt
total 8
-rw-r--r--. 1 root root 0 Mar 31 04:21 abrt-db
-rw------- 1 root root 12 Jun 11 10:09 last-via-server
-rw------- 1 root root 35 Jun 18 13:43 last-ccpp
tmproot@usa0300lv846>
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: replication issues

Post by tmcdonald »

Go ahead and attach the last-ccpp
Former Nagios employee
ravish78
Posts: 269
Joined: Wed Mar 14, 2012 9:50 am

Re: replication issues

Post by ravish78 »

tmproot@usa0300lv846>cat last-ccpp
/usr/local/nagios/libexec/send_nsca
tmproot@usa0300lv846>


Can we open an incident as it is effecting production environment
Last edited by sreinhardt on Thu Jun 19, 2014 2:05 pm, edited 1 time in total.
Reason: Locked due to internal ticket being opened.
Locked