Page 2 of 2

Re: ClusterGroup Confusion

Posted: Thu Jul 18, 2019 7:11 pm
by nickanderson1982
Maybe I'm not understanding the logic. I create a cfg file with hosts and services. Do I not need to include hosts and addresses? Is it just a cfg file and then check services statement?

I'm just used to creating a host file and adding services to it. This is a concept I haven't setup before. Can you please clarify for me.

Re: ClusterGroup Confusion

Posted: Fri Jul 19, 2019 7:00 am
by scottwilkerson
you wuld create a host/service like this

Code: Select all

define host {
    host_name          CLUSTERHOST
    use                generic-host
    address            CLUSTERHOST
    register           1
}

define service {
    host_name                CLUSTERHOST
    service_description      SERVICENAME
    use                      genericnet-service
    check_command   check_service_cluster!"DNS Cluster"!1!1!$SERVICESTATEID:APIA-A:DNS Service$,$SERVICESTATEID:APIA-B:DNS Service$,$SERVICESTATEID:APIA-B:DNS Service$
    register                 1
}
Obviously replacing APIA-A and APIA-B and "DNS Service" in the check_command to match the actual host and service name of the hosts and services in the cluster

Re: ClusterGroup Confusion

Posted: Fri Jul 19, 2019 1:05 pm
by nickanderson1982
I created the cfg file with my settings but I am receiving a critical message.


define host {
host_name APIA-Servers
use generic-host
address APIA-Servers
register 1
max_check_attempts 5
check_interval 5
retry_interval 1
check_period 24x7
contact_groups admins
notification_interval 60
notification_period 24x7
notifications_enabled 1
icon_image ncpa.png
statusmap_image ncpa.png
}

define service {
host_name APIA-Servers
service_description Name Event Processor [Prod]
use generic-service
check_command check_service_cluster!"Name Event Processor [Prod]"!1!1!$SERVICESTATEID:APIA-1:"Name Event Processor [Prod]"$,$SERVICESTATEID:APIA-2:"Name Event Processor [Prod]"$
register 1
max_check_attempts 5
check_interval 5
retry_interval 1
check_period 24x7
notification_interval 60
notification_period 24x7
contact_groups admins
}

Current Status: CRITICAL (for 0d 0h 2m 22s)
Status Information: CLUSTER CRITICAL: Name Event Processor [Prod]: 0 ok, 0 warning, 2 unknown, 0 critical
Performance Data:
Current Attempt: 5/5 (HARD state)
Last Check Time: 07-19-2019 13:02:18
Check Type: ACTIVE
Check Latency / Duration: 0.000 / 0.001 seconds
Next Scheduled Check: 07-19-2019 13:07:18
Last State Change: 07-19-2019 13:02:18
Last Notification: N/A (notification 0)
Is This Service Flapping? NO (6.25% state change)
In Scheduled Downtime? NO
Last Update: 07-19-2019 13:04:33 ( 0d 0h 0m 7s ago)

Re: ClusterGroup Confusion

Posted: Fri Jul 19, 2019 1:45 pm
by scottwilkerson
You should not have " in here inside of the macros

Code: Select all

define service {


check_command check_service_cluster!"Name Event Processor [Prod]"!1!1!$SERVICESTATEID:APIA-1:"Name Event Processor [Prod]"$,$SERVICESTATEID:APIA-2:"Name Event Processor [Prod]"$


}
Replace with

Code: Select all

check_command check_service_cluster!"Name Event Processor [Prod]"!1!1!$SERVICESTATEID:APIA-1:Name Event Processor [Prod]$,$SERVICESTATEID:APIA-2:Name Event Processor [Prod]$

Re: ClusterGroup Confusion

Posted: Fri Jul 19, 2019 1:47 pm
by scottwilkerson
Also, what is the current state of host / service?
APIA-1 / Name Event Processor [Prod]

is APIA-1 the actual full hostname?

Re: ClusterGroup Confusion

Posted: Fri Jul 19, 2019 2:25 pm
by nickanderson1982
Scott, thank you for your assistance. That was the issue. I appreciate you walking me through this concept from beginning to end. This ticket is completed.

Re: ClusterGroup Confusion

Posted: Fri Jul 19, 2019 3:28 pm
by scottwilkerson
nickanderson1982 wrote:Scott, thank you for your assistance. That was the issue. I appreciate you walking me through this concept from beginning to end. This ticket is completed.
Great!

Locking