Not data collected on monitor with service dependency
Posted: Wed Aug 26, 2015 11:40 am
Nagios XI 2.7
RHEL 6
NCPA 1.8.1 agents
I have just noticed something today.
We have backup processes on our Linux servers that run at various times throughout the week. In a lot of cases the CPU maxes out during this time and would start alerting.
I created a service dependency to correct this. HostA/CPU depends on HostA/backupprocess.exe
I have reverse logic with the backup process checks. If the process is seen running, that check goes critical and since the CPU check depends on it, alerting is stopped. But today I just realized that NO cpu data was being collected during this window. We have several now running for the last 8 hours and the cpu data is missing for that exact time window.
Here is the screenshot of the CPU graph:
Here is the service dependency rule for one of the hosts:
RHEL 6
NCPA 1.8.1 agents
I have just noticed something today.
We have backup processes on our Linux servers that run at various times throughout the week. In a lot of cases the CPU maxes out during this time and would start alerting.
I created a service dependency to correct this. HostA/CPU depends on HostA/backupprocess.exe
I have reverse logic with the backup process checks. If the process is seen running, that check goes critical and since the CPU check depends on it, alerting is stopped. But today I just realized that NO cpu data was being collected during this window. We have several now running for the last 8 hours and the cpu data is missing for that exact time window.
Here is the screenshot of the CPU graph:
Here is the service dependency rule for one of the hosts:
Code: Select all
define servicedependency {
dependent_host_name servera.domain
dependent_service_description CPU Usage
host_name servera.domain
service_description Instances of save,Instances of savefs
inherits_parent 1
execution_failure_criteria w,c,
notification_failure_criteria w,c,
dependency_period xi_timeperiod_24x7