use Nagios to record arbitrary (non-problem) events?
Posted: Wed Nov 26, 2014 8:04 am
hi,
I use Nagios today to monitor "problem" related things, like servers downtime, apps that stopped responding, etc, and all this monitorings fire alerts on Nagios and trigger Jira tickets to the support team.
but would it be possible to tell Nagios to record things that are not problems, and keep the information so I could see it in the reports and timelines?
for example, I use Jenkins to make my deploys, and so, Jenkins jobs connects to my servers, stop my apps services, uploads the new version, and starts the app service back. I would like to notify Nagios things like: "jenkins stopped app1 service on serverX", "jenkins uploaded app1 version 1.1.x to serverX", "jenkins started app1 service on serverX", or... "Jira ticket NNN closed related to Database team group", or... "sensitive application configuration changed by user via web console".
none of those "events" were problems. they happed as demanded, and they finished with success. but I wanted they shown along with other Nagios alerts, so it could help the alerts analysis. for example, imagine Nagios detect and app crash minutes after a redeploy, of a CPU overload after a configuration change. that could be a helping clue, right?
but as long as I could find, Nagios only records "problems", "alerts", "incidents", and there is no way to tell him things that happened sucessfully. I know I could just store good-events as "problems" with a nice prefix name like "[EVENT-INFO]" or something,... but I think there is a better way to do this.
any suggestions?
thanks in advance.
I use Nagios today to monitor "problem" related things, like servers downtime, apps that stopped responding, etc, and all this monitorings fire alerts on Nagios and trigger Jira tickets to the support team.
but would it be possible to tell Nagios to record things that are not problems, and keep the information so I could see it in the reports and timelines?
for example, I use Jenkins to make my deploys, and so, Jenkins jobs connects to my servers, stop my apps services, uploads the new version, and starts the app service back. I would like to notify Nagios things like: "jenkins stopped app1 service on serverX", "jenkins uploaded app1 version 1.1.x to serverX", "jenkins started app1 service on serverX", or... "Jira ticket NNN closed related to Database team group", or... "sensitive application configuration changed by user via web console".
none of those "events" were problems. they happed as demanded, and they finished with success. but I wanted they shown along with other Nagios alerts, so it could help the alerts analysis. for example, imagine Nagios detect and app crash minutes after a redeploy, of a CPU overload after a configuration change. that could be a helping clue, right?
but as long as I could find, Nagios only records "problems", "alerts", "incidents", and there is no way to tell him things that happened sucessfully. I know I could just store good-events as "problems" with a nice prefix name like "[EVENT-INFO]" or something,... but I think there is a better way to do this.
any suggestions?
thanks in advance.