Installing/Configuring Nagios Core 4.1.1.
Posted: Thu Dec 24, 2015 9:37 am
Unfortunately, I'm setting up Nagios to cover our environment in background - between projects and crises. I need a few questions answered.
I currently have nagios-4.1.1.tar.gz built and installed on Nagios001 (see attached drawing.) I also have nagios-plugins-2.1.1.tar.gz built and installed, and I've attempted to install NRPE but it fails. The OS is Red Hat Enterprise Linux Server release 6.4 (Santiago).
All the servers I have installed are running the same OS - Red Hat Enterprise Linux Server release 6.4 (Santiago).
I have configured to successfully perform a check_ping to all the servers to monitor them for at least being alive. (No nagios installed on any of them.) But I now have some additional needs.
1) I need to be able to monitor apl-001 for changes in size of /tmp/file1. What is the best way to do this?
2) I need to be able to monitor memory, network, and CPU utilization on all the compute-00x nodes.
3) I need to be able to run a program (qstat) on login-001 and watch the output for errors. In this case, the "qstat" is from Univa Grid Engine. However, the need exists to be able to run a program (my choice) on any node and check its output.
4) I need to be able to monitor httpd on a node for appropriate output to be able to be sure it is alive. I've tried it to the "nagios001" httpd instance, but so far have failed, so I'm going to eventually need to figure that out. But it is a longer-term need.
5) I need to know what historic data is being maintained, or that I can maintain, so I can start to do workload analysis over the long-term.
The questions I have pretty much are related to:
1) What do I need to install for it to be done?
2) How will I need to configure it?
3) Where is the documentation I can use to be able to configure things for myself? There are too many documents that seem to be just stubs, or are for older versions of the code.
I'm in need of open source solutions as often as humanly possible.
Thanks in advance for your help.
I currently have nagios-4.1.1.tar.gz built and installed on Nagios001 (see attached drawing.) I also have nagios-plugins-2.1.1.tar.gz built and installed, and I've attempted to install NRPE but it fails. The OS is Red Hat Enterprise Linux Server release 6.4 (Santiago).
All the servers I have installed are running the same OS - Red Hat Enterprise Linux Server release 6.4 (Santiago).
I have configured to successfully perform a check_ping to all the servers to monitor them for at least being alive. (No nagios installed on any of them.) But I now have some additional needs.
1) I need to be able to monitor apl-001 for changes in size of /tmp/file1. What is the best way to do this?
2) I need to be able to monitor memory, network, and CPU utilization on all the compute-00x nodes.
3) I need to be able to run a program (qstat) on login-001 and watch the output for errors. In this case, the "qstat" is from Univa Grid Engine. However, the need exists to be able to run a program (my choice) on any node and check its output.
4) I need to be able to monitor httpd on a node for appropriate output to be able to be sure it is alive. I've tried it to the "nagios001" httpd instance, but so far have failed, so I'm going to eventually need to figure that out. But it is a longer-term need.
5) I need to know what historic data is being maintained, or that I can maintain, so I can start to do workload analysis over the long-term.
The questions I have pretty much are related to:
1) What do I need to install for it to be done?
2) How will I need to configure it?
3) Where is the documentation I can use to be able to configure things for myself? There are too many documents that seem to be just stubs, or are for older versions of the code.
I'm in need of open source solutions as often as humanly possible.
Thanks in advance for your help.