Page 1 of 1

NCPA RPM-PPC AIX agent failing to start up

Posted: Tue Dec 05, 2017 2:51 pm
by krobertson71
We are trying to install the latest version of the AIX agent on AIX 7.1. We keep getting the following error when trying to start everything up.

Code: Select all

Installed on **redactedhostname** server.
While starting “ncpa_listerer” subsystem, see that same error we saw initially:
 
LABEL:          SRC_SVKO
IDENTIFIER:     BC3BE5A3
 
Date/Time:       Tue Dec  5 10:17:44 EST 2017
Sequence Number: 160691
Machine Id:      00F7C0E74C00
Node Id:         **RedactedHostName**
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SRC
 
Description
SOFTWARE PROGRAM ERROR
 
Probable Causes
APPLICATION PROGRAM
 
Failure Causes
SOFTWARE PROGRAM
 
        Recommended Actions
        MANUALLY RESTART SUBSYSTEM IF NEEDED
 
Detail Data
SYMPTOM CODE
       65280
SOFTWARE ERROR CODE
       -9017
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'376'
FAILING MODULE
ncpa_listener
 
The same is true for “ncpa_passive” subsystem.

Re: NCPA RPM-PPC AIX agent failing to start up

Posted: Tue Dec 05, 2017 5:46 pm
by tgriep
Can you post the steps you are taking to start the Agents?
You have to pass the library location like the following examples to start the agents.

Code: Select all

startsrc -e LIBPATH=/usr/local/ncpa -s ncpa_listener
startsrc -e LIBPATH=/usr/local/ncpa -s ncpa_passive
Try that and post back your results.

Re: NCPA RPM-PPC AIX agent failing to start up

Posted: Tue Dec 05, 2017 7:35 pm
by krobertson71
Here is what the Unix admin used.


startsrc –s ncpa_listener
startsrc –s ncpa_passive”


I will have him try what you supplied and post results here.

Re: NCPA RPM-PPC AIX agent failing to start up

Posted: Tue Dec 05, 2017 10:08 pm
by krobertson71
The Unix admin used the command as you described. Still having the same issue. Since this is an RPM the /usr/local/ncpa/ path is there as that it how it was built.

Here is the information I received from the Unix admin..

Code: Select all

Restarted as requested – same error:
 
root@RedactedHostName:[/] # startsrc -e LIBPATH=/usr/local/ncpa -s ncpa_listener
0513-059 The ncpa_listener Subsystem has been started. Subsystem PID is 29753486.
 
root@RedactedHostName:[/] # lssrc -s ncpa_listener
Subsystem         Group            PID          Status
ncpa_listener                                  inoperative
 
root@redactedhostname:[/] # errpt
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
BC3BE5A3   1205200317 P S SRC            SOFTWARE PROGRAM ERROR
 
root@redactedhostname:[/] # errpt -a
LABEL:          SRC_SVKO
IDENTIFIER:     BC3BE5A3
 
Date/Time:       Tue Dec  5 20:03:20 EST 2017
Sequence Number: 160693
Machine Id:      00F7C0E74C00
Node Id:         redactedhostname
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SRC
 
Description
SOFTWARE PROGRAM ERROR
 
Probable Causes
APPLICATION PROGRAM
 
Failure Causes
SOFTWARE PROGRAM
 
        Recommended Actions
        MANUALLY RESTART SUBSYSTEM IF NEEDED
 
Detail Data
SYMPTOM CODE
         256
SOFTWARE ERROR CODE
       -9017
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'376'
FAILING MODULE
ncpa_listener

Here is the contents of the /usr/local/ncpa/ directory so you can see if something is missing.

Code: Select all

The content of “/usr/local/nrpa” is listed below:
 
-rw-r--r--    1 nagios   nagios        11249 Nov 09 13:16 _bisect.so
-rw-r--r--    1 nagios   nagios       273126 Nov 09 13:16 _cffi_backend.so
-rw-r--r--    1 nagios   nagios       138530 Nov 09 13:16 _codecs_cn.so
-rw-r--r--    1 nagios   nagios       150745 Nov 09 13:16 _codecs_hk.so
-rw-r--r--    1 nagios   nagios        32080 Nov 09 13:16 _codecs_iso2022.so
-rw-r--r--    1 nagios   nagios       239563 Nov 09 13:16 _codecs_jp.so
-rw-r--r--    1 nagios   nagios       129170 Nov 09 13:16 _codecs_kr.so
-rw-r--r--    1 nagios   nagios       102004 Nov 09 13:16 _codecs_tw.so
-rw-r--r--    1 nagios   nagios        53179 Nov 09 13:16 _collections.so
-rw-r--r--    1 nagios   nagios       229599 Nov 09 13:16 _ctypes.so
-rw-r--r--    1 nagios   nagios        19798 Nov 09 13:16 _functools.so
-rw-r--r--    1 nagios   nagios        36081 Nov 09 13:16 _hashlib.so
-rw-r--r--    1 nagios   nagios        24048 Nov 09 13:16 _heapq.so
-rw-r--r--    1 nagios   nagios       264124 Nov 09 13:16 _io.so
-rw-r--r--    1 nagios   nagios        60533 Nov 09 13:16 _json.so
-rw-r--r--    1 nagios   nagios        30814 Nov 09 13:16 _locale.so
-rw-r--r--    1 nagios   nagios        52138 Nov 09 13:16 _multibytecodec.so
-rw-r--r--    1 nagios   nagios        20528 Nov 09 13:16 _random.so
-rw-r--r--    1 nagios   nagios       132384 Nov 09 13:16 _socket.so
-rw-r--r--    1 nagios   nagios       152007 Nov 09 13:16 _sqlite3.so
-rw-r--r--    1 nagios   nagios       169106 Nov 09 13:16 _ssl.so
-rw-r--r--    1 nagios   nagios        65096 Nov 09 13:16 _struct.so
-rw-r--r--    1 nagios   nagios        74975 Nov 09 13:16 array.so
-rw-r--r--    1 nagios   nagios        28665 Nov 09 13:16 binascii.so
drwxr-xr-x    2 nagios   nagios          256 Dec 05 10:16 build_resources
-rw-r--r--    1 nagios   nagios        59724 Nov 09 13:16 bz2.so
-rw-r--r--    1 nagios   nagios       149532 Nov 09 13:16 cPickle.so
-rw-r--r--    1 nagios   nagios        29417 Nov 09 13:16 cStringIO.so
-rw-r--r--    1 nagios   nagios         7717 Nov 09 13:16 cryptography.hazmat.bindings._constant_time.so
-rw-r--r--    1 nagios   nagios      2057754 Nov 09 13:16 cryptography.hazmat.bindings._openssl.so
-rw-r--r--    1 nagios   nagios       149715 Nov 09 13:16 datetime.so
drwxr-xr-x    3 nagios   nagios          256 Dec 05 10:16 etc
-rw-r--r--    1 nagios   nagios        19160 Nov 09 13:16 fcntl.so
-rw-r--r--    1 nagios   nagios       207082 Nov 09 13:16 gevent._semaphore.so
-rw-r--r--    1 nagios   nagios        39892 Nov 09 13:16 gevent._socket3.py
-rw-r--r--    1 nagios   nagios       810266 Nov 09 13:16 gevent.libev.corecext.so
-rw-r--r--    1 nagios   nagios        49776 Nov 09 13:16 greenlet.so
-rw-r--r--    1 nagios   nagios        13101 Nov 09 13:16 grp.so
-rw-r--r--    1 nagios   nagios        93835 Nov 09 13:16 itertools.so
-rw-r--r--    1 nagios   nagios         7765 Nov 09 13:16 jinja2.asyncsupport.py
-rw-r--r--    1 nagios   nagios     30537213 Nov 09 13:16 libcrypto.a
-rw-r--r--    1 nagios   nagios      3341074 Nov 09 13:16 libcrypto.so
-rw-r--r--    1 nagios   nagios       250592 Nov 09 13:16 libffi.a
-rw-r--r--    1 nagios   nagios      1044082 Nov 09 13:16 libgcc_s.a
-rw-r--r--    1 nagios   nagios      3009655 Nov 09 13:16 libpython2.7.so
-rw-r--r--    1 nagios   nagios      3502369 Nov 09 13:16 library.zip
-rw-r--r--    1 nagios   nagios      1946327 Nov 09 13:16 libsqlite3.a
-rw-r--r--    1 nagios   nagios       700634 Nov 09 13:16 libssl.so
drwxr-xr-x    4 nagios   nagios          256 Dec 05 10:16 listener
-rw-r--r--    1 nagios   nagios       243774 Nov 09 13:16 math.so
-rwxr-xr-x    1 nagios   nagios       750066 Nov 09 13:16 ncpa_listener
-rwxr-xr-x    1 nagios   nagios       750066 Nov 09 13:16 ncpa_passive
-rw-r--r--    1 nagios   nagios        70616 Nov 09 13:16 operator.so
drwxr-xr-x    2 nagios   nagios          256 Dec 05 10:16 plugins
-rw-r--r--    1 nagios   nagios        61837 Nov 09 13:16 psutil._psutil_aix.so
-rw-r--r--    1 nagios   nagios        21996 Nov 09 13:16 psutil._psutil_posix.so
-rw-r--r--    1 nagios   nagios        88419 Nov 09 13:16 pyexpat.so
-rw-r--r--    1 nagios   nagios        47246 Nov 09 13:16 readline.so
-rw-r--r--    1 nagios   nagios        29541 Nov 09 13:16 select.so
-rw-r--r--    1 nagios   nagios        41085 Nov 09 13:16 strop.so
-rw-r--r--    1 nagios   nagios        26736 Nov 09 13:16 termios.so
-rw-r--r--    1 nagios   nagios        46649 Nov 09 13:16 time.so
-rw-r--r--    1 nagios   nagios       701901 Nov 09 13:16 unicodedata.so
drwxr-xr-x    4 nagios   nagios          256 Dec 05 10:16 var
-rw-r--r--    1 nagios   nagios        36227 Nov 09 13:16 zlib.so

Re: NCPA RPM-PPC AIX agent failing to start up

Posted: Wed Dec 06, 2017 3:15 pm
by dwhitfield
We don't have AIX on which to test and the agent was built using 7.2. I'm not sure the process in upgrading from 7.1 to 7.2, but is that possible? If not, we'll probably need to set up a time to remote in. If you want to do that, you'll need to submit a ticket at https://support.nagios.com/tickets/

Have you considered NRPE instead?

Re: NCPA RPM-PPC AIX agent failing to start up

Posted: Wed Dec 06, 2017 4:02 pm
by krobertson71
Thank you we may end up doing that.

We think now we may be having connection issues on SSH. NRPE installed fine but we were unable to make a check work and that may also be related to the SSH issue.

Leave this open for another day and I will update this thread.

Re: NCPA RPM-PPC AIX agent failing to start up

Posted: Wed Dec 06, 2017 4:36 pm
by dwhitfield
I'm not being pedantic, because we do have checks that run over ssh, but NCPA and NRPE generally don't (I think never) run over ssh, but ssl. Do you mean ssl?

Did you compile NRPE? If so, what version? Did you compile with ssl enabled? You mention NRPE in your last post, but until then you were talking about NCPA. Did you mean NRPE in the last post? We shouldn't trouble-shoot an NRPE issue in an NCPA thread. I just mentioned it as an alterntive, as NRPE has been around longer, and thus it is often easier to find answers to questions about it.