delay in service check
-
amit.ahuja
- Posts: 113
- Joined: Fri May 15, 2015 8:20 am
delay in service check
i have a service "File_Check" that check every minute whether a file exist or not on multiple servers. But i notice there's a delay in a check process. it doesn't check every min for some of the servers.
You do not have the required permissions to view the files attached to this post.
Re: delay in service check
Can we see the configuration for this service check? I imagine the check_interval may be wrong.
Former Nagios Employee.
me.
me.
-
amit.ahuja
- Posts: 113
- Joined: Fri May 15, 2015 8:20 am
Re: delay in service check
Code: Select all
define service {
host_name testbox
service_description File_Check
check_command check_nrpe!check_file_exists!-a /www/html/Keepalive.html!!!!!!
max_check_attempts 3
check_interval 1
retry_interval 2
check_period 24x7
notification_interval 15
contact_groups support
notification_period 24x7
notifications_enabled 0
notification_options w,c,r
_xiwizard nrpe
register 1
}
Re: delay in service check
Can you please post the definition that relates to 'vews016'? This one appears to be fore textbox, and checking a different file.
Former Nagios Employee
-
amit.ahuja
- Posts: 113
- Joined: Fri May 15, 2015 8:20 am
Re: delay in service check
It's the same configuration, just like the other hosts. some hosts are checking every minute, some are not.
Code: Select all
define service {
host_name vews016
service_description File_Check
check_command check_nrpe!check_file_exists!-a /macys.war/macyshc.html!!!!!!
max_check_attempts 3
check_interval 1
retry_interval 2
check_period 24x7
notification_interval 15
contact_groups support
notification_period 24x7
notifications_enabled 0
notification_options w,c,r
_xiwizard nrpe
register 1
Re: delay in service check
Can you please post your /usr/local/nagios/etc/nagios.cfg file here for review?
Former Nagios Employee.
me.
me.
-
amit.ahuja
- Posts: 113
- Joined: Fri May 15, 2015 8:20 am
Re: delay in service check
sure.
You do not have the required permissions to view the files attached to this post.
Re: delay in service check
This looks fine as well. I wonder if something is going on in your system.
Can you PM over a profile? (Admin -> System Profile -> Download Profile)
EDIT: profile received
Can you PM over a profile? (Admin -> System Profile -> Download Profile)
EDIT: profile received
Former Nagios Employee
Re: delay in service check
Just to confirm, do you have 32G of ram allocated to this machine?
I am seeing a few errors -
Additionally, at the top of your processes I saw this -
What is the output of df -H?
I am seeing a few errors -
Code: Select all
Mar 4 08:48:04 MA100DLVMON812 nagios: wproc: 'Core Worker 21908' seems to be choked. ret = -1; bufsize = 117: errno = 11 (Resource temporarily unavailable)
Code: Select all
160303 7:30:07 [Warning] Disk is full writing './nagios/nagios_logentries.TMD' (Errcode: 28). Waiting for someone to free space... (Expect up to 60 secs delay for server to continue after freeing disk space)
Code: Select all
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 24976 51.6 2.5 984824 844964 pts/2 S+ 16:49 0:07 vim nagios_logentries.MYD
root 17385 0.1 0.7 2897740 233684 ? Sl 2015 358:58 /opt/IBM/ITM/lx8266/lz/bin/klzagent
Former Nagios Employee
-
amit.ahuja
- Posts: 113
- Joined: Fri May 15, 2015 8:20 am
Re: delay in service check
yes i do have 32G allocated to this vm, i saw that /var was full and cleaned them. i also changed some performance setting and adjust reaper setting in nagios.cfg. It's working now.
Thanks
Thanks