Page 1 of 1

replication issues

Posted: Tue Jun 17, 2014 10:06 am
by ravish78
Team,
Our data replication is not working fine from slave to master.We are getting below errors in log messages in slave server.
Can you please look into this.

00007fffffffb610 error 6 in send_nsca[400000+5000]
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 041: Jun 17 04:16:48 usa0300lv846 abrt[7548]: Not saving repeating crash in '/usr/local/nagios/libexec/send_nsca'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 042: Jun 17 04:16:49 usa0300lv846 kernel: send_nsca[7630]: segfault at 7ffffffff000 ip 0000000000401c48 sp 00007fffffffb610 error 6 in send_nsca[400000+5000]
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 043: Jun 17 04:16:49 usa0300lv846 abrt[7930]: Not saving repeating crash in '/usr/local/nagios/libexec/send_nsca'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 044: Jun 17 04:17:02 usa0300lv846 kernel: send_nsca[8526]: segfault at 7ffffffff000 ip 0000000000401c48 sp 00007fffffffb610 error 6 in send_nsca[400000+5000]
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 045: Jun 17 04:17:02 usa0300lv846 abrt[8527]: Saved core dump of pid 8526 (/usr/local/nagios/libexec/send_nsca) to /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526 (380928 bytes)
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 046: Jun 17 04:17:02 usa0300lv846 abrtd: Executable '/usr/local/nagios/libexec/send_nsca' doesn't belong to any package and ProcessUnpackaged is set to 'no'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 047: Jun 17 04:17:02 usa0300lv846 abrt[8553]: Not saving repeating crash in '/usr/local/nagios/libexec/send_nsca'
Jun 17 11:00:33 usa0300lv846 rootsh[07093]: tmproot: 048: Jun 17 04:17:02 usa0300lv846 kernel: send_nsca[8552]: segfault at 7ffffffff000 ip 0000000000401c48 sp 00007fffffffb610 error 6 in send_nsca[400000+5000]

Re: replication issues

Posted: Tue Jun 17, 2014 1:43 pm
by scottwilkerson
How did you set this up?

It looks like send_nsca is segfaulting, but this could have any number of causes, could you attach the core dump from /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526 ?

Code: Select all

Saved core dump of pid 8526 (/usr/local/nagios/libexec/send_nsca) to /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526 (380928 bytes)

Re: replication issues

Posted: Tue Jun 17, 2014 2:16 pm
by ravish78
It is not showing anything with that file name

tmproot@usa0300lv846>ls -lrt /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526
ls: cannot access /var/spool/abrt/ccpp-2014-06-17-04:17:02-8526: No such file or directory

tmproot@usa0300lv846>ls -lart
total 16
-rw-r--r--. 1 root root 0 Mar 31 04:21 abrt-db
drwxr-xr-x. 13 root root 4096 Mar 31 04:22 ..
-rw------- 1 root root 12 Jun 11 10:09 last-via-server
-rw------- 1 root root 35 Jun 17 15:14 last-ccpp
drwxr-xr-x. 2 abrt abrt 4096 Jun 17 15:14 .

Re: replication issues

Posted: Wed Jun 18, 2014 9:26 am
by slansing
It looks like it could possibly be contained in "last-ccpp", though I'd think it would be the file name that you saw in the log. Is that all you have in (/var/spool/abrt/) ?

Re: replication issues

Posted: Wed Jun 18, 2014 12:45 pm
by ravish78
Yes only those 3 files are visible in that path.

tmproot@usa0300lv846>ls -lrt
total 8
-rw-r--r--. 1 root root 0 Mar 31 04:21 abrt-db
-rw------- 1 root root 12 Jun 11 10:09 last-via-server
-rw------- 1 root root 35 Jun 18 13:43 last-ccpp
tmproot@usa0300lv846>

Re: replication issues

Posted: Wed Jun 18, 2014 4:50 pm
by tmcdonald
Go ahead and attach the last-ccpp

Re: replication issues

Posted: Thu Jun 19, 2014 9:15 am
by ravish78
tmproot@usa0300lv846>cat last-ccpp
/usr/local/nagios/libexec/send_nsca
tmproot@usa0300lv846>


Can we open an incident as it is effecting production environment