Trying to figure out why logstash changed to active (exited)

This support forum board is for support questions relating to Nagios Log Server, our solution for managing and monitoring critical log data.
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Trying to figure out why logstash changed to active (exited)

Post by rferebee »

Good morning,

The logstash service on one of my Log Server nodes randomly changed status from active (running) to active (exited) this morning. Looking at the logstash log file, I don't see any event that explains why this occurred. We had an issue on 11/9 that caused the snapshot for that evening to stall out and never complete, but I got that resolved this morning and everything seemed fine after. Then at about 10:15 AM the logstash service status flipped.

Can I send someone the log files to take a look at? I would love to know why this happened.

Thank you.
User avatar
mbellerue
Posts: 1403
Joined: Fri Jul 12, 2019 11:10 am

Re: Trying to figure out why logstash changed to active (exi

Post by mbellerue »

Certainly, go ahead and send the system profiles to me, and I will get them to the team for review.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Re: Trying to figure out why logstash changed to active (exi

Post by rferebee »

This same behavior occurred this morning. Any ideas as to what might be causing it?
User avatar
mbellerue
Posts: 1403
Joined: Fri Jul 12, 2019 11:10 am

Re: Trying to figure out why logstash changed to active (exi

Post by mbellerue »

Nothing is jumping out at me immediately. On the server that should be primary, could you run dmesg and send me the output? I'm wondering if something is getting logged there that could give us a clue.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Re: Trying to figure out why logstash changed to active (exi

Post by rferebee »

I had to PM you because the output was too many characters.
User avatar
mbellerue
Posts: 1403
Joined: Fri Jul 12, 2019 11:10 am

Re: Trying to figure out why logstash changed to active (exi

Post by mbellerue »

There are a lot of messages for CIFS in there. Are you mounting a file system via Samba? If so, what's it for, and does the other Log Server instance have the same mount? I'm not sure that's the problem, but it definitely stands out.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Re: Trying to figure out why logstash changed to active (exi

Post by rferebee »

Yes. We mount our Log Repository via that CIFS share. Each node has the same corresponding mount command in it's fstab config file.

Our Log Repository is a dedicated NAS device, so it must be mounted as a network drive would be.

This is the mount command we use:

Code: Select all

# CIFS Mount
//10.128.xxx.xxx/NLSREPCC /nlsrepcc cifs rw,username=********,password=********,uid=996,gid=994,file_mode=0770,dir_mode=0770 0 0
The only difference between each node are the UID and GID.
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Re: Trying to figure out why logstash changed to active (exi

Post by rferebee »

This happened again this morning. I'm curious to know why it's only happening on 1 node. It can't be someone overloading the system with a search that's too large otherwise the whole environment would be down. It's just the logstash service on 1 node.
User avatar
mbellerue
Posts: 1403
Joined: Fri Jul 12, 2019 11:10 am

Re: Trying to figure out why logstash changed to active (exi

Post by mbellerue »

Have you changed any settings in logstash recently?

Can you setup a check on the Log Server that is failing to monitor the number of TCP connections it has? Ideally once per minute would be best.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Re: Trying to figure out why logstash changed to active (exi

Post by rferebee »

We have not made any changes to logstash.

Would you like me to monitor the TCP connections on Log Server as a whole or on each individual node?
Locked