We have a relatively new install of NagiosXI on CentOS 5.6 (x64) with one parent, and multiple children fed through dnx.
We have around 40 hosts in there now, each with checks, and each working perfectly fine for notifications from the hosts, and services default contacts.
We have a .com site that has several of the same type of server in each tier (Web/App/DB).
The issue occurs when we set up an escalation.
We can select ALL servers in each tier, and say an HTTPD check as the service, but the only service that gets selected is HTTPD for the first server.
None of the other servers get selected.
So we add an escalation with two hosts (xxxwb1 & xxxwb2) and one service (HTTPD) and we get the following:
Code: Select all
select * from tbl_serviceescalation;
id config_name host_name hostgroup_name service_description contacts contact_groups first_notification last_notification notification_interval escalation_period escalation_options active last_modified access_rights config_id
"45" "testing" "1" "0" "1" "1" "0" "2" "2" "15" "2" "c" "1" "2012-03-29 10:55:48" "<NULL>" "1"
Code: Select all
select * from tbl_lnkServiceescalationToHost where idMaster = 45;
idMaster idSlave
"45" "317"
"45" "318"
Code: Select all
select id, host_name from tbl_host where id in (317, 318);
id host_name
"317" "xxxwb1"
"318" "xxxwb2"
But now when we look for the service we associated.
Code: Select all
select * from tbl_lnkServiceescalationToService where idMaster = 45;
idMaster idSlave
"45" "1784"
Code: Select all
select id, config_name, host_name, service_description from tbl_service where id = 1784;
id config_name host_name service_description
"1784" "xxxwb0" "1" "HTTPD"
It picked up the HTTPD service for a different host.
This is HTTPD for xxxwb0, not 1 or 2...
and as a result, when we tested it, it did not escalate for wb1 or wb2, nor wb0 for that matter.
Oh Great and wise nagios Gurus, please show me the error of my ways.