Page 1 of 1

AWS Services Monitoring

Posted: Wed Aug 10, 2016 8:36 am
by hrks
Hi,
We are looking in to NagiosXI as possible solution for monitoring AWS resources.
I have installed NagiosXI.
Per my searches, it seems like there is not a built-in or Nagios provided plugin for auto-discovery of resources/services available under our AWS accounts.

Is following community plugin still the most popular/recommended option: check_aws_cloudwatch_metrics
https://exchange.nagios.org/directory/P ... cs/details

A few important points for us to consider before deciding if NagiosXI is a good fit for our use case:
-- HA and automatic failover between AZs; resilient to failures, including AZ failures.

-- Multi-tenancy and RBAC; users should be able to monitor/administer per their application environment.

-- Support for using multiple tags to filter and place nodes in to separate application, deployment etc environments.

-- Auto-discovery of inventory, eg. auto responding to ASG scale-up scale-down events etc.

-- Instance names (not ec2 hostnames nor instance-ids ) can be duplicates per application environment, we want monitoring tool to handle this by using IP or hostname for monitoring. In other words it should not lump together two different instances simply because AWS console instance name (first column in EC2 instances view) is the same.

-- Support for multiple AWS services that can be monitored with CloudWatch, including ELBs, EC2, RDS, SQS, etc.

With these main points in mind, what other plugins or external applications may be needed to augment NagiosXI for AWS monitoring?


Thank you!

Re: AWS Services Monitoring

Posted: Wed Aug 10, 2016 3:29 pm
by rkennedy
This is quite a lot, so going to break your questions down -
Is following community plugin still the most popular/recommended option: check_aws_cloudwatch_metrics
https://exchange.nagios.org/directory/P ... cs/details
It's hard to say, we do not have any official reccomendations at all for AWS plugins.
-- HA and automatic failover between AZs; resilient to failures, including AZ failures.
XI does not have HA built in. Linbit handles DRBD / HA for Nagios XI. Here's more information - http://www.linbit.com/en/resources/tech ... centos-6-5
-- Multi-tenancy and RBAC; users should be able to monitor/administer per their application environment.
See this document - https://assets.nagios.com/downloads/nag ... ios-XI.pdf
-- Support for using multiple tags to filter and place nodes in to separate application, deployment etc environments.
Not sure what you're asking here, could you clarify?
-- Auto-discovery of inventory, eg. auto responding to ASG scale-up scale-down events etc.
We have an auto discovery wizard that will scan a subnet and allow you to add hosts in. Another option is to use our API. (look under the Help section in Nagios XI for more info on the API) If you're looking for 'auto responding', take a look at event handlers to see if that's what would work for you. Nagios provides the frame work, but you should be able to communicate between different pieces of software using the event handlers in conjunction with something you write up.
https://assets.nagios.com/downloads/nag ... ios-XI.pdf
-- Instance names (not ec2 hostnames nor instance-ids ) can be duplicates per application environment, we want monitoring tool to handle this by using IP or hostname for monitoring. In other words it should not lump together two different instances simply because AWS console instance name (first column in EC2 instances view) is the same.
I don't think Nagios will care what the ec2 hostnames / instance-ids are, as long as the 'host' field in Nagios is unique.
-- Support for multiple AWS services that can be monitored with CloudWatch, including ELBs, EC2, RDS, SQS, etc.
Nagios can monitor just about anything that you can provide a result with over the CLI. Take a look at our Exchange (https://exchange.nagios.org) to see if any plugins here will work for what you're looking to monitor.