Cannot Start nrpe daemon on aix after TL update

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
islesfan1
Posts: 9
Joined: Thu Jun 20, 2013 7:30 am

Cannot Start nrpe daemon on aix after TL update

Post by islesfan1 »

I cannot GET THE NRPE DAEMON TO START IN aix after a AIX TL upgrade. Did a force install on all RPM's even though they were installed, also reinstalled Nagios XI agent for AIX and it fails during the startsrc -s nrpe command. I have attached the install logs to get assistance in troubleshooting. I cannot start the nrpe daemon.
I get this in the error log. we are running AIX Tl 6100-09 sp1.
It worked on 6100-08.

---------------------------------------------------------------------------

Code: Select all

LABEL:          SRC_TRYX
IDENTIFIER:     1BA7DF4E

Date/Time:       Mon Mar 31 09:02:25 2014
Sequence Number: 14406
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        DETERMINE WHY SUBSYSTEM CANNOT RESTART

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9020
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'369'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_RSTRT
IDENTIFIER:     CB4A951F

Date/Time:       Mon Mar 31 09:02:25 2014
Sequence Number: 14405
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            INFO
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9035
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'234'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_RSTRT
IDENTIFIER:     CB4A951F

Date/Time:       Mon Mar 31 09:02:25 2014
Sequence Number: 14404
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            INFO
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9035
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'234'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_TRYX
IDENTIFIER:     1BA7DF4E

Date/Time:       Mon Mar 31 08:57:01 2014
Sequence Number: 14403
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        DETERMINE WHY SUBSYSTEM CANNOT RESTART

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9020
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'369'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
LABEL:          SRC_RSTRT
IDENTIFIER:     CB4A951F

Date/Time:       Mon Mar 31 08:57:01 2014
Sequence Number: 14402
Machine Id:      00F6F8F14C00
Node Id:         egateqa01-cdc
Class:           S
Type:            INFO
WPAR:            Global
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY

Detail Data
SYMPTOM CODE
         512
SOFTWARE ERROR CODE
       -9035
ERROR CODE
           0
DETECTING MODULE
'srchevn.c'@line:'234'
FAILING MODULE
nrpe
---------------------------------------------------------------------------
You do not have the required permissions to view the files attached to this post.
Mark Koutsos
Senior UNIX System Administrator
360 PARK AVE S, 12TH FLOOR, ROOM 12-022
NEW YORK, NY 10010
EMAIL: [email protected]
DESK (646) 524-0543
CELL (347) 852-0916
FAX (646) 524-0101
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Cannot Start nrpe daemon on aix after TL update

Post by abrist »

Well, the logs indicate that nrpe installed and restarted successfully. When you resintsalled nrpe, did you use the same source directory as before? I ask, because a reinstall of nrpe requires the removal of any installed.* files. Additionally, are you installing from source, or from a binary package/tarball?
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
islesfan1
Posts: 9
Joined: Thu Jun 20, 2013 7:30 am

Re: Cannot Start nrpe daemon on aix after TL update

Post by islesfan1 »

HI,

I got the agent to start after ...
Once the following was commented from /etc/rc.tcpip…

# Start up the snmpmibd daemon
# start /usr/sbin/snmpmibd "$src_running"

# Start up the aixmibd daemon
# start /usr/sbin/aixmibd "$src_running"

We were able to start our Nagios XI nrpe daemon.

How do these two processes not running impact the server overall and will it cause an issue if we reboot as we have set them to not start.
Mark Koutsos
Senior UNIX System Administrator
360 PARK AVE S, 12TH FLOOR, ROOM 12-022
NEW YORK, NY 10010
EMAIL: [email protected]
DESK (646) 524-0543
CELL (347) 852-0916
FAX (646) 524-0101
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Cannot Start nrpe daemon on aix after TL update

Post by slansing »

Well, that is more of an AIX sysadmin question and not a Nagios one, but you should be able to follow what is outlined here for the agent to auto-start those as well:

http://assets.nagios.com/downloads/nagi ... Nagios.pdf

In other words, a standard service/process start as you would configure on any system.
Locked