Page 3 of 5

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 3:32 pm
by scottwilkerson
Also as for the /nlsrepcc permissions issue, can you show the output of

Code: Select all

grep nagios /etc/passwd /etc/group
We want to make sure they match up with what we need on page 5 & 6 of this doc
https://assets.nagios.com/downloads/nag ... ations.pdf

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 3:37 pm
by rferebee
Right on! That fixed that problem.

Now, onto the snapshot problem. My available storage keeps decreasing. Is that because it can't off load the log data to the repository?

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 3:42 pm
by rferebee
Here's what I've got.

LSCC2

Code: Select all

root@nagioslscc2:/var/log> grep nagios /etc/passwd /etc/group
/etc/passwd:nagios:x:996:994::/var/spool/nagios:/sbin/nologin
/etc/group:nagios:x:994:nrpe,nagios,apache
/etc/group:apache:x:48:nagios
/etc/group:nagcmd:x:3031:nagios

Code: Select all

//10.128.207.113/NLSREPCC /nlsrepcc cifs rw,username=nagios,password=password,uid=996,gid=994,file_mode=0770,dir_mode=0770 0 0
LSCC1

Code: Select all

/etc/passwd:nagios:x:987:982::/var/spool/nagios:/bin/bash
/etc/group:nagios:x:982:apache,nrpe,nagios
/etc/group:apache:x:48:nagios
/etc/group:nagcmd:x:3031:nagios

Code: Select all

//10.128.207.113/NLSREPCC /nlsrepcc cifs rw,username=nagios,password=password,uid=987,gid=982,file_mode=0770,dir_mode=0770 0 0
LSCC3

Code: Select all

/etc/passwd:nagios:x:3033:100::/home/nagios:/bin/bash
/etc/group:apache:x:48:nagios
/etc/group:nagios:x:3033:nagios,apache
/etc/group:nagcmd:x:3034:nagios

Code: Select all

//10.128.207.113/NLSREPCC /nlsrepcc cifs rw,username=nagios,password=password,uid=3033,gid=100,file_mode=0770,dir_mode=0770 0 0

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 3:52 pm
by scottwilkerson
On LSCC3 you have this

Code: Select all

//10.128.207.113/NLSREPCC /nlsrepcc cifs rw,username=nagios,password=password,uid=3033,gid=100,file_mode=0770,dir_mode=0770 0 0
but it should be this

Code: Select all

//10.128.207.113/NLSREPCC /nlsrepcc cifs rw,username=nagios,password=password,uid=3033,gid=3033,file_mode=0770,dir_mode=0770 0 0
as 3033 is the gid got the nagios group

Code: Select all

/etc/group:nagios:x:3033:nagios,apache
Make that change and then remount the drive

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 4:00 pm
by rferebee
Ok, I made the change and remounted the share.

Is there anyway I can test this without having to run a snapshot. It's been over two weeks since we've taken one and it's going to take forever to finish.

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 4:17 pm
by scottwilkerson
The only thing I can think of is testing write permissions from each node as the nagios user

from LSCC1

Code: Select all

su nagios -c 'touch /nlsrepcc/test_LSCC1'
from LSCC2

Code: Select all

su nagios -c 'touch /nlsrepcc/test_LSCC1'
from LSCC3

Code: Select all

su nagios -c 'touch /nlsrepcc/test_LSCC3'
Then from any node

Code: Select all

ls -al /nlsrepcc/test_LSCC*
the user for all 3 files should be nagios

clean it up

Code: Select all

rm -f /nlsrepcc/test_LSCC*

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 4:29 pm
by rferebee
Ok, all of that worked, minus the fact that somehow the nagios account password wasn't set correctly on LSCC2. Not sure how the heck that changed.

I'll let you folks know tomorrow whether or not the snapshot ran overnight.

Thanks for your continued support!

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Thu Oct 03, 2019 4:39 pm
by scottwilkerson
rferebee wrote:Ok, all of that worked, minus the fact that somehow the nagios account password wasn't set correctly on LSCC2. Not sure how the heck that changed.

I'll let you folks know tomorrow whether or not the snapshot ran overnight.

Thanks for your continued support!
Sounds good!

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Fri Oct 04, 2019 9:18 am
by rferebee
Good morning. Unfortunately, it doesn't look like the snapshot ran last night despite the command subsystem saying it did.

I'm sort of at a loss here. Everything seems to be configured correctly, but the snapshots don't even look like they're trying to run. Is there an error log specifically for snapshots? Or, anything else I can look at?

We're going on 3 weeks now that we haven't taken a snapshot. I'd like to get one ASAP now that I have the storage available to do it.

Thank you!

Re: Cannot access Cluster Status page after 2.1.0 update

Posted: Fri Oct 04, 2019 9:22 am
by scottwilkerson
If you go to Admin -> Command Subsystem
what do you show in the line for snapshots_maintenance?

Also, what do you see in Admin -> Audit Log -> Type=BACKUP ?