Page 1 of 2

Error while addition of instance in a cluster

Posted: Wed Sep 15, 2021 5:50 am
by SuryanuSanyal
Hi Team,
Whenever I am trying to add a new cluster, I am getting the below error

could not generate node_uuid - check file permissions

Kindly help.

Also is it possible to add instances later after configuration is done?

Re: Error while addition of instance in a cluster

Posted: Wed Sep 15, 2021 2:25 pm
by gsmith
HI,

What are the output of these commands (run as root)?

Code: Select all

chage -l nagios
ls -ld /usr/local/nagioslogserver/var
ls -l /usr/local/nagioslogserver/var
grep nag /etc/group

Thank you

Re: Error while addition of instance in a cluster

Posted: Mon Sep 20, 2021 3:15 am
by SuryanuSanyal
Added screenshot of output post executing commands as root. Kindly assist.

Re: Error while addition of instance in a cluster

Posted: Mon Sep 20, 2021 2:23 pm
by ssax
Those permissions look proper. Those were run on the new node, correct?

What version of Log Server did you install on the new instance?

What is the OS/version of the new instance?

Code: Select all

uname -a
cat /etc/*release
Try doing this:
- NOTE: To be clear, this is assuming you are running these commands on a new node that was never connected to a cluster. If that is not the case, don't do this

Code: Select all

mv /usr/local/nagioslogserver/var/node_uuid /usr/local/nagioslogserver/var/node_uuid.old
systemctl restart elasticsearch
Then try connecting the node again.

Re: Error while addition of instance in a cluster

Posted: Fri Sep 24, 2021 8:57 am
by SuryanuSanyal
Hi

I have attached the version details

After that I have executed the following commands in the single primary instance
[root@<MachineName> /]# mv /usr/local/nagioslogserver/var/node_uuid /usr/local/nagioslogserver/var/node_uuid.old
[root@<MachineName> /]# systemctl restart elasticsearch

Still getting the following error while adding a new instance
Could not generate node_uuid - check file permissions

Re: Error while addition of instance in a cluster

Posted: Fri Sep 24, 2021 4:42 pm
by ssax
I do not see anything attached, make sure to click the Add the file button after selecting the file.

What version of Log Server did you install on the new instance?

What is the OS/version of the new instance?

Code: Select all

uname -a
cat /etc/*release
Just to make sure I'm understanding this properly, you are not trying to add an instance to the same node, correct? This is a new node you spun up, freshly installed Nagios Log Server on, and then tried to connect to the existing cluster, right?

Please try connecting the new node again and then right after attach or PM me the XXXXX-XXXXXX-XXXX-XXXX-XXXX.log file from this directory:

Code: Select all

/var/log/elasticsearch
Are you see any error messages in these files? (if so, please send them)

Code: Select all

/var/log/messages
/var/log/httpd/error_log
/var/log/httpd/ssl_error_log
/usr/local/nagioslogserver/var/auditlog.log
/usr/local/nagioslogserver/var/jobs.log
/usr/local/nagioslogserver/var/poller.log

Re: Error while addition of instance in a cluster

Posted: Thu Sep 30, 2021 9:47 am
by SuryanuSanyal
All the below commands or paths have been executed/ are in reference with the new instance which has not been connected to the primary instance

/var/log/messages: Attached
/var/log/httpd/error_log: Attached
/var/log/httpd/ssl_error_log: File not present
/usr/local/nagioslogserver/var/auditlog.log: No data present
/usr/local/nagioslogserver/var/jobs.log: No data present
/usr/local/nagioslogserver/var/poller.log: No data present

For version
[root@<NewInstanceName> /]# uname -a
Linux <NewInstanceName> 4.18.0-305.12.1.el8_4.x86_64 #1 SMP Mon Jul 26 08:06:24 EDT 2021 x86_64 x86_64 x86_64 GNU/Linux

[root@<NewInstanceName> /]# cat /etc/*release
NAME="Red Hat Enterprise Linux"
VERSION="8.4 (Ootpa)"
ID="rhel"
ID_LIKE="fedora"
VERSION_ID="8.4"
PLATFORM_ID="platform:el8"
PRETTY_NAME="Red Hat Enterprise Linux 8.4 (Ootpa)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:redhat:enterprise_linux:8.4:GA"
HOME_URL="https://www.redhat.com/"
DOCUMENTATION_URL="https://access.redhat.com/documentation ... e_linux/8/"
BUG_REPORT_URL="https://bugzilla.redhat.com/"

REDHAT_BUGZILLA_PRODUCT="Red Hat Enterprise Linux 8"
REDHAT_BUGZILLA_PRODUCT_VERSION=8.4
REDHAT_SUPPORT_PRODUCT="Red Hat Enterprise Linux"
REDHAT_SUPPORT_PRODUCT_VERSION="8.4"
Red Hat Enterprise Linux release 8.4 (Ootpa)
Red Hat Enterprise Linux release 8.4 (Ootpa)

[root@<NewInstanceName> /]# mv /usr/local/nagioslogserver/var/node_uuid /usr/local/nagioslogserver/var/node_uuid.old

[root@<NewInstanceName> /]# systemctl restart elasticsearch

[root@<NewInstanceName> /]# cd /var/log/elasticsearch

[root@<NewInstanceName> elasticsearch]# ls -ltrh
total 8.0K
-rw-r--r-- 1 nagios users 0 Sep 10 21:14 df5c2d92-82bb-45d8-866d-2aafef776b69_index_indexing_slowlog.log
-rw-r--r-- 1 nagios users 0 Sep 10 21:14 df5c2d92-82bb-45d8-866d-2aafef776b69_index_search_slowlog.log
-rw-r--r-- 1 nagios users 795 Sep 10 22:00 df5c2d92-82bb-45d8-866d-2aafef776b69.log.1.gz
-rw-r--r-- 1 nagios users 2.4K Sep 29 10:12 df5c2d92-82bb-45d8-866d-2aafef776b69.log

Re: Error while addition of instance in a cluster

Posted: Fri Oct 01, 2021 10:12 am
by gsmith
Hi

Looking back over the posts in this thread I noticed:
Image2.jpg
The step in the yellow asks you to run the commands:

Code: Select all

mv /usr/local/nagioslogserver/var/node_uuid /usr/local/nagioslogserver/var/node_uuid.old
systemctl restart elasticsearch
on the new server/node, but it looks like you ran them on the original (existing) Log Server node
based on what is circled in RED

Please run the commands:

Code: Select all

mv /usr/local/nagioslogserver/var/node_uuid /usr/local/nagioslogserver/var/node_uuid.old
systemctl restart elasticsearch
on the machine that is trying to join as a second node.

Thanks

Re: Error while addition of instance in a cluster

Posted: Tue Oct 05, 2021 11:19 am
by SuryanuSanyal
Hi Team,

In my previous comment I have run the commands in the new instance itself. Please double check once. Also attached screenshot for reference.

Re: Error while addition of instance in a cluster

Posted: Wed Oct 06, 2021 9:36 am
by gsmith
Hi,

On a command line on the new node, which I believe is 10.255.36.65, run the following:

Code: Select all

nmap -p 9300 10.255.36.66
and send me the results

Next run:

Code: Select all

tar czf /tmp/support6oct.tar.gz /usr/local/nagioslogserver/var/*
Save the support6oct.tar.gz file and share the file in a private message,
then reply to this post to bring it up in the queue.

Thanks