Problem index log every day

This support forum board is for support questions relating to Nagios Log Server, our solution for managing and monitoring critical log data.
chris_Espoir
Posts: 19
Joined: Tue Dec 02, 2014 12:43 pm

Problem index log every day

Post by chris_Espoir »

Hello everybody,

I am using nagioslog in cluster with two instances (64-bit ova)

Every morning, my Dashboard is empty and I must restart servers for rebuild index. On the web interface, service are UP then when I refresh page they are DOWN…

On ssh I got this :
[root@nagioslog ~]# service logstash status
Log stash Daemon dead but pid file exists

To resolve problem temporarily, I have created crontab which restart my servers every 5 hours. That seems resolve my problem, but I don’t think is the best solution…

Thanks for your help


PS:
For info, actually there are 72 hosts and 6,000,000 Docs per days.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Problem index log every day

Post by tmcdonald »

What sort of CPU, memory, and disk space are available on the nodes? Are you hitting limits or high usage on any of those metrics?
Former Nagios employee
chris_Espoir
Posts: 19
Joined: Tue Dec 02, 2014 12:43 pm

Re: Problem index log every day

Post by chris_Espoir »

Hello,

VM nagioslog are working on ESX Intel Xeon 2 GHz, using 4vCPU, 4 Go & 1 file ".vmdk" of 100 Go (actually 50G used) for both.

Before, VM had 1 vCPU, since I add 3 vCPU, I don't hit limits.


Thanks for your help
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Problem index log every day

Post by tmcdonald »

Are you saying you increased the limits and the problem went away? Or that you weren't hitting limits before and you still aren't now but the problem persists?
Former Nagios employee
chris_Espoir
Posts: 19
Joined: Tue Dec 02, 2014 12:43 pm

Re: Problem index log every day

Post by chris_Espoir »

Sorry for my english (I’m French :P)

Each morning, when I saw my Dashboard was empty, the web interface was very slow and CPU was 100%.
Then I add 3vCPU, and each morning the same problem (Dashboard empty, web interface slow) but CPU consume 157% of 400%

But the problem persists, except when I restart server.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Problem index log every day

Post by scottwilkerson »

How much memory does this server have?
Does it have fast disks installed (e.g. SSD's)?
How much data per day are you sending this server?
Is this just a single server cluster, or do you have multiple machines sharing the workload?
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
chris_Espoir
Posts: 19
Joined: Tue Dec 02, 2014 12:43 pm

Re: Problem index log every day

Post by chris_Espoir »

How much memory does this server have?
4GB of memory
Does it have fast disks installed (e.g. SSD's)?
No fast disk, only 7200 RPM
How much data per day are you sending this server?
We are sending between 6,100,000 & 6,800,000 Documents per day
In Primary Size it's between 2.8GB & 3,1GB
Is this just a single server cluster, or do you have multiple machines sharing the workload?
We have a cluster with 2 instances, with DNS round robin.
All instances have same capacity (memory, Disk, vCPU).
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Problem index log every day

Post by tmcdonald »

Make sure each server in the cluster is running the "elasticsearch" and "logstash" services:

Code: Select all

service elasticsearch status
service logstash status
If either of those are not running, please start them and wait a few minutes.
Former Nagios employee
chris_Espoir
Posts: 19
Joined: Tue Dec 02, 2014 12:43 pm

Re: Problem index log every day

Post by chris_Espoir »

tmcdonald wrote:Make sure each server in the cluster is running the "elasticsearch" and "logstash" services:

Code: Select all

service elasticsearch status
service logstash status
If either of those are not running, please start them and wait a few minutes.


When I execute command status is OK.
I monitor these service with Nagios XI.

If I don't restart servers nagioslog, total process will increases (500 process in 3 days), 100% of 400% CPU use by Java & and web interface very slow, but service elastic search and logstash are Ok with command service elasticsearch status.


I hope you understand my problem :/
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Problem index log every day

Post by scottwilkerson »

chris_Espoir wrote:total process will increases (500 process in 3 days),
Would it be possible for us to get a list of the processes running when this happens?

Code: Select all

ps -ef
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Locked