I have recently defined a custom time period for a certain service. It goes like this:
Code: Select all
define timeperiod{
timeperiod_name GridLoad
alias grid is running
sunday -1 march - sunday -1 october 12:59-18:20 ;BST
sunday -1 october - sunday -1 march 11:59-17:20 ;Non-BST
}
Code: Select all
define service{
use generic-service
name SelfCheckServiceGridLoad
servicegroups SelfCheckServiceGrp
hostgroup_name NagiosServerGrp
check_period GridLoad ; The service can only be checked when we expect the grid to start
max_check_attempts 5 ; Re-check the service up to 5 times in order to determine its final (hard) state
normal_check_interval 1 ; Check the service every minute under normal conditions
retry_check_interval 1 ; Re-check the service every minute until a hard state can be determined
contact_groups admins ; Notifications get sent out to everyone in the 'admins' group
notification_options w,u,c,r ; Send notifications about warning, unknown, critical, and recovery events
notification_interval 60 ; Re-notify about service problems every hour
notification_period GridLoad ; Notifications can be sent out only during the period
notifications_enabled 1 ; Service notifications are enabled
register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE!
}
Code: Select all
define service{
use SelfCheckServiceGridLoad
service_description Grid Load check
check_command check_grid_load
}
When the next day comes in and we enter the period again, however, it doesn't restart checking again as it should. It seems to be "over-sleeping", if you excuse my dream metaphors
Is this behaviour expected? Have i overlooked anything? All help will be much appreciated.