No logs in dashboard - Logstash crashing?
Posted: Tue Nov 17, 2015 5:22 pm
Hi There,
I have NLS running on 2 nodes - has been working well for quite a while now - it has been very set and forget. One of our users recently told me there was nothing in the dashboard, turns out they were right. If I restart elasticsearch and logstash I get about a minute worth of data then ... nothing. The nodes are identical CentOS (installed from source following the guides on nagios.com) running on ESX5.5. I have given them each 4 cores and 4 Gb of RAM. I'm thinking where I have gone wrong is not telling logstash that more RAM is available. Here is what I saw in /var/log/messages after restarting logstash on one of the nodes:
Nov 17 13:50:19 hector logstash: Nov 17, 2015 1:50:19 PM org.elasticsearch.transport.netty.NettyTransport exceptionCaught
Nov 17 13:50:19 hector logstash: WARNING: [d81b097a-4eb8-4223-8ead-08131a78d2fa] exception caught on transport layer [[id: 0x41228af7, /127.0.0.1:33608 :> localhost/127.0.0.1:9300]], closing connection
Nov 17 13:50:19 hector logstash: java.io.StreamCorruptedException: invalid internal transport message format, got (5a,56,1,1)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.transport.netty.SizeHeaderFrameDecoder.decode(SizeHeaderFrameDecoder.java:47)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.callDecode(FrameDecoder.java:425)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.cleanup(FrameDecoder.java:482)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.channelDisconnected(FrameDecoder.java:365)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:102)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:559)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.Channels.fireChannelDisconnected(Channels.java:396)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.Channels$4.run(Channels.java:386)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.ChannelRunnableWrapper.run(ChannelRunnableWrapper.java:40)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.AbstractNioSelector.processTaskQueue(AbstractNioSelector.java:391)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:315)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.AbstractNioWorker.run(AbstractNioWorker.java:89)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.NioWorker.run(NioWorker.java:178)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.util.ThreadRenamingRunnable.run(ThreadRenamingRunnable.java:108)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.util.internal.DeadLockProofWorker$1.run(DeadLockProofWorker.java:42)
Nov 17 13:50:19 hector logstash: at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
Nov 17 13:50:19 hector logstash: at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
Nov 17 13:50:19 hector logstash: at java.lang.Thread.run(Thread.java:745)
Nov 17 13:50:22 hector logstash: Error: Your application used more memory than the safety cap of 500M.
Nov 17 13:50:22 hector logstash: Specify -J-Xmx####m to increase it (#### = cap size in MB).
Nov 17 13:50:22 hector logstash: Specify -w for full OutOfMemoryError stack trace
Nov 17 20:10:01 hector systemd: Failed to mark scope session-817.scope as abandoned : Stale file handle
Nov 17 20:50:03 hector systemd: Failed to mark scope session-899.scope as abandoned : Stale file handle
I've highlighted in red the part that is concerning me. If I have 4Gb of RAM available, how high should I put the safety cap? Also, when it says "Specify -J-Xmx####m to increase it (#### = cap size in MB)." where do I specify this? I've had a look at a few other forum posts but can't find the instructions on adding RAM that I am looking for.
I have NLS running on 2 nodes - has been working well for quite a while now - it has been very set and forget. One of our users recently told me there was nothing in the dashboard, turns out they were right. If I restart elasticsearch and logstash I get about a minute worth of data then ... nothing. The nodes are identical CentOS (installed from source following the guides on nagios.com) running on ESX5.5. I have given them each 4 cores and 4 Gb of RAM. I'm thinking where I have gone wrong is not telling logstash that more RAM is available. Here is what I saw in /var/log/messages after restarting logstash on one of the nodes:
Nov 17 13:50:19 hector logstash: Nov 17, 2015 1:50:19 PM org.elasticsearch.transport.netty.NettyTransport exceptionCaught
Nov 17 13:50:19 hector logstash: WARNING: [d81b097a-4eb8-4223-8ead-08131a78d2fa] exception caught on transport layer [[id: 0x41228af7, /127.0.0.1:33608 :> localhost/127.0.0.1:9300]], closing connection
Nov 17 13:50:19 hector logstash: java.io.StreamCorruptedException: invalid internal transport message format, got (5a,56,1,1)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.transport.netty.SizeHeaderFrameDecoder.decode(SizeHeaderFrameDecoder.java:47)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.callDecode(FrameDecoder.java:425)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.cleanup(FrameDecoder.java:482)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.handler.codec.frame.FrameDecoder.channelDisconnected(FrameDecoder.java:365)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:102)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:559)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.Channels.fireChannelDisconnected(Channels.java:396)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.Channels$4.run(Channels.java:386)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.ChannelRunnableWrapper.run(ChannelRunnableWrapper.java:40)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.AbstractNioSelector.processTaskQueue(AbstractNioSelector.java:391)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:315)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.AbstractNioWorker.run(AbstractNioWorker.java:89)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.channel.socket.nio.NioWorker.run(NioWorker.java:178)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.util.ThreadRenamingRunnable.run(ThreadRenamingRunnable.java:108)
Nov 17 13:50:19 hector logstash: at org.elasticsearch.common.netty.util.internal.DeadLockProofWorker$1.run(DeadLockProofWorker.java:42)
Nov 17 13:50:19 hector logstash: at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
Nov 17 13:50:19 hector logstash: at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
Nov 17 13:50:19 hector logstash: at java.lang.Thread.run(Thread.java:745)
Nov 17 13:50:22 hector logstash: Error: Your application used more memory than the safety cap of 500M.
Nov 17 13:50:22 hector logstash: Specify -J-Xmx####m to increase it (#### = cap size in MB).
Nov 17 13:50:22 hector logstash: Specify -w for full OutOfMemoryError stack trace
Nov 17 20:10:01 hector systemd: Failed to mark scope session-817.scope as abandoned : Stale file handle
Nov 17 20:50:03 hector systemd: Failed to mark scope session-899.scope as abandoned : Stale file handle
I've highlighted in red the part that is concerning me. If I have 4Gb of RAM available, how high should I put the safety cap? Also, when it says "Specify -J-Xmx####m to increase it (#### = cap size in MB)." where do I specify this? I've had a look at a few other forum posts but can't find the instructions on adding RAM that I am looking for.