Page 1 of 1

XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:25 pm
by vmwareguy
For whatever reason all of my services past Database Backend show a bad status. I have no idea where to even begin to trouble shoot this. I can't apply any new configurations. I ran a repair on the database and that didn't help. I'm running version 5.5.1

Any thoughts?

Thanks

Re: XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:30 pm
by scottwilkerson
All of these depend on crond running.

What is the output of the following:

Code: Select all

service crond status
chage -l nagios

Re: XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:31 pm
by vmwareguy
I see a bunch of nagios expired passwords
pam_unix(crond:account): account nagios has expired (failed to change password)

Re: XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:32 pm
by vmwareguy
is it as simple as changing the nagios password ?

Re: XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:34 pm
by scottwilkerson
This should do it

Code: Select all

chage -I -1 -m 0 -M 99999 -E -1 nagios

Re: XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:37 pm
by vmwareguy
scottwilkerson wrote:This should do it

Code: Select all

chage -I -1 -m 0 -M 99999 -E -1 nagios
This did it!! Thank you. What exactly did this change?

Re: XI System Component Status: Failing

Posted: Tue Oct 09, 2018 12:38 pm
by scottwilkerson
vmwareguy wrote:This did it!! Thank you. What exactly did this change?
It set the password for the nagios user to not expire for 99999 days.
Minimum Password Age to 0
Maximum Password Age to 99999
Password Inactive to -1
Account Expiration Date to -1

Re: XI System Component Status: Failing

Posted: Wed Oct 10, 2018 11:34 am
by vmwareguy
So its working somewhat better but after about an hour I have two services showing errors now.

Re: XI System Component Status: Failing

Posted: Wed Oct 10, 2018 12:13 pm
by scottwilkerson
you may need to remove 2 lock file

Code: Select all

rm -f /usr/local/nagiosxi/var/dbmaint.lock
rm -f /usr/local/nagiosxi/var/event_handler.lock
Then they should kick off within about 5-10 minutes.