Alerting for heartbeat messages

This support forum board is for support questions relating to Nagios Log Server, our solution for managing and monitoring critical log data.
Locked
Jklre
Posts: 163
Joined: Wed May 28, 2014 1:56 pm

Alerting for heartbeat messages

Post by Jklre »

I haven't found the ability to create an alert if a heartbeat message has not been received. We have several processes that send out these heartbeats and we want to know if they are not responding. Is this possible? Thanks in advance
jolson
Attack Rabbit
Posts: 2560
Joined: Thu Feb 12, 2015 12:40 pm

Re: Alerting for heartbeat messages

Post by jolson »

Can you please elaborate a little bit more? Does this machine send a heartbeat in the form of a 'log' to the Log Server - or does it heartbeat to a different component and send a log to NLS that explains whether the heartbeat was successful or not? Just need a little bit more info here. Thanks!
Twits Blog
Show me a man who lives alone and has a perpetually clean kitchen, and 8 times out of 9 I'll show you a man with detestable spiritual qualities.
Jklre
Posts: 163
Joined: Wed May 28, 2014 1:56 pm

Re: Alerting for heartbeat messages

Post by Jklre »

jolson wrote:Can you please elaborate a little bit more? Does this machine send a heartbeat in the form of a 'log' to the Log Server - or does it heartbeat to a different component and send a log to NLS that explains whether the heartbeat was successful or not? Just need a little bit more info here. Thanks!

This just send a syslog message for example "<14>10001 gpgd_02.exe: gpgd_02.exe v1.8 Heartbeat" every 5 minutes. If the message does not get sent then the process is dead / hung / not running. We have a few other ones that send off a message every 15 - 30 minutes or so but its all through syslog.

So ideally we would want to run a query for messages received in the past 5 minutes and if its 0 then throw an alert.
Jklre
Posts: 163
Joined: Wed May 28, 2014 1:56 pm

Re: Alerting for heartbeat messages

Post by Jklre »

Jklre wrote:
jolson wrote:Can you please elaborate a little bit more? Does this machine send a heartbeat in the form of a 'log' to the Log Server - or does it heartbeat to a different component and send a log to NLS that explains whether the heartbeat was successful or not? Just need a little bit more info here. Thanks!

This just send a syslog message for example "<14>10001 gpgd_02.exe: gpgd_02.exe v1.8 Heartbeat" every 5 minutes. If the message does not get sent then the process is dead / hung / not running. We have a few other ones that send off a message every 15 - 30 minutes or so but its all through syslog.

So ideally we would want to run a query for messages received in the past 5 minutes and if its 0 then throw an alert.
There's some similar functionality to what i'm talking about in MIeventD

http://mathias-kettner.com/checkmk_mkev ... nting.html

What I'm talking about is in the "Expect regular messages" section.
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Alerting for heartbeat messages

Post by jdalrymple »

It sounds like MK is doing the same thing there as freshness in Nagios.

http://nagios.sourceforge.net/docs/3_0/freshness.html

The question is whether or not Nagios is already receiving "passive checks" or if you're wanting to monitor some 3rd party message queue. It sounds like the latter. There are numerous plugins available on the exchange to monitor logfiles, although I don't know of any that interact with a syslogd to monitor freshness that way.

http://exchange.nagios.org/directory/Plugins/Log-Files
Jklre
Posts: 163
Joined: Wed May 28, 2014 1:56 pm

Re: Alerting for heartbeat messages

Post by Jklre »

jdalrymple wrote:It sounds like MK is doing the same thing there as freshness in Nagios.

http://nagios.sourceforge.net/docs/3_0/freshness.html

The question is whether or not Nagios is already receiving "passive checks" or if you're wanting to monitor some 3rd party message queue. It sounds like the latter. There are numerous plugins available on the exchange to monitor logfiles, although I don't know of any that interact with a syslogd to monitor freshness that way.

http://exchange.nagios.org/directory/Plugins/Log-Files
Yeah.. I looked into other nagios plugins and there are none for syslogd and having another third party plugin look at the same syslog stream isn't a very practical solution.
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Alerting for heartbeat messages

Post by jdalrymple »

Sorry I wasn't paying attention to what forum I was in, I thought I was answering a Nagios question, not NLS. My apologies for sending mixed up info.

Upon further research, you're not the first person to ask for this. We have an internal feature request for this that is outstanding. I'll +1 it to our devs, but as of now we don't have a simple feature like what you're looking for.

It's not the best answer, but it's all I've got. Can I lock this thread?
Jklre
Posts: 163
Joined: Wed May 28, 2014 1:56 pm

Re: Alerting for heartbeat messages

Post by Jklre »

jdalrymple wrote:Sorry I wasn't paying attention to what forum I was in, I thought I was answering a Nagios question, not NLS. My apologies for sending mixed up info.

Upon further research, you're not the first person to ask for this. We have an internal feature request for this that is outstanding. I'll +1 it to our devs, but as of now we don't have a simple feature like what you're looking for.

It's not the best answer, but it's all I've got. Can I lock this thread?
Yes please +1 +1 +1 :D I have a few other questions but i'll start a different thread form them. Thank you.
Locked