Page 1 of 1

Cannot Start nrpe daemon on aix after TL update

Posted: Mon Mar 31, 2014 8:15 am
by islesfan1
I cannot GET THE NRPE DAEMON TO START IN aix after a AIX TL upgrade. Did a force install on all RPM's even though they were installed, also reinstalled Nagios XI agent for AIX and it fails during the startsrc -s nrpe command. I have attached the install logs to get assistance in troubleshooting. I cannot start the nrpe daemon.
I get this in the error log. we are running AIX Tl 6100-09 sp1.
It worked on 6100-08.

---------------------------------------------------------------------------

Code: Select all

LABEL:          SRC_TRYX
IDENTIFIER:     1BA7DF4E

Date/Time:       Mon Mar 31 09:02:25 2014
Sequence Number: 14406
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        DETERMINE WHY SUBSYSTEM CANNOT RESTART

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9020
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'369'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_RSTRT
IDENTIFIER:     CB4A951F

Date/Time:       Mon Mar 31 09:02:25 2014
Sequence Number: 14405
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            INFO
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9035
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'234'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_RSTRT
IDENTIFIER:     CB4A951F

Date/Time:       Mon Mar 31 09:02:25 2014
Sequence Number: 14404
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            INFO
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9035
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'234'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_TRYX
IDENTIFIER:     1BA7DF4E

Date/Time:       Mon Mar 31 08:57:01 2014
Sequence Number: 14403
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        DETERMINE WHY SUBSYSTEM CANNOT RESTART

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9020
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'369'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_RSTRT
IDENTIFIER:     CB4A951F

Date/Time:       Mon Mar 31 08:57:01 2014
Sequence Number: 14402
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            INFO
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9035
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'234'
FAILING MODULE
nrpe
---------------------------------------------------------------------------

Re: Cannot Start nrpe daemon on aix after TL update

Posted: Mon Mar 31, 2014 4:16 pm
by abrist
Well, the logs indicate that nrpe installed and restarted successfully. When you resintsalled nrpe, did you use the same source directory as before? I ask, because a reinstall of nrpe requires the removal of any installed.* files. Additionally, are you installing from source, or from a binary package/tarball?

Re: Cannot Start nrpe daemon on aix after TL update

Posted: Wed Apr 02, 2014 11:35 am
by islesfan1
HI,

I got the agent to start after ...
Once the following was commented from /etc/rc.tcpip…

# Start up the snmpmibd daemon
# start /usr/sbin/snmpmibd "$src_running"

# Start up the aixmibd daemon
# start /usr/sbin/aixmibd "$src_running"

We were able to start our Nagios XI nrpe daemon.

How do these two processes not running impact the server overall and will it cause an issue if we reboot as we have set them to not start.

Re: Cannot Start nrpe daemon on aix after TL update

Posted: Wed Apr 02, 2014 5:18 pm
by slansing
Well, that is more of an AIX sysadmin question and not a Nagios one, but you should be able to follow what is outlined here for the agent to auto-start those as well:

http://assets.nagios.com/downloads/nagi ... Nagios.pdf

In other words, a standard service/process start as you would configure on any system.