We seem to be having periods of time where some of our hosts/services are timing out (60/30 secs). Most of these arent causing HARD notifications, but show up on our monitoring board and have drawn the attention of our support team. Don't see anything unusual in our system statistics/monitoring engine stats. Database usage looks normal as well. Included system profile.
Code: Select all
[root@bnalmnag702 var]# less eventman.log
[contact] => xxxxxxxxx
[contactemail] => [email protected]
[type] => PROBLEM
[escalated] => 0
[author] =>
[comments] =>
[host] => Mxxxxxx
[hostaddress] => 10.200.170.100
[hostalias] => Gxxxxxxx
[hostdisplayname] => Mxxxxx
[hoststate] => DOWN
[hoststateid] => 1
[lasthoststate] => DOWN
[lasthoststateid] => 1
[hoststatetype] => HARD
[currentattempt] => 5
[maxattempts] => 5
[hosteventid] => 1167267
[hostproblemid] => 372100
[hostoutput] => (Host check timed out after 31.01 seconds)
[longhostoutput] =>
[datetime] => Tue Aug 24 03:52:58 CDT 2021