not execcuting service checks, itsonly checking for flapping
Posted: Mon Sep 08, 2014 3:59 pm
Hello,
Service checks do not seem to be happening, when it time to run the service check,it only
checks to see if its flapping instead of executing the actual check comand.
The 'last check' looks good, about 3 minutes ago
The 'next check' looks good, about 5 minutes in the future
In the scheduling queue, I see the service check is scheduled, same as the 'next check' time in service status.
When it comes time to check, it does not run the check command, and reschedules the next check anyway.
If I manually fail the website ( what the service is checking) and it comes time to check the service, it does
not mark it down..
In nagios.debug , when it somes time to run the service check, i see
Checking service 'FOO' on host 'BAR' for flapping...
[1410202548.652321] [016.1] [pid=26838] Service is not flapping (0.00% state change).
This is happening to all my services. I tried stopping nagios , removing the retention file, now
everything is stuck in 'pending' and the 'next check' time keeps incrementing.
This is nagios 3.4.1 on Debian Wheezy. My main config file is:
Service checks do not seem to be happening, when it time to run the service check,it only
checks to see if its flapping instead of executing the actual check comand.
The 'last check' looks good, about 3 minutes ago
The 'next check' looks good, about 5 minutes in the future
In the scheduling queue, I see the service check is scheduled, same as the 'next check' time in service status.
When it comes time to check, it does not run the check command, and reschedules the next check anyway.
If I manually fail the website ( what the service is checking) and it comes time to check the service, it does
not mark it down..
In nagios.debug , when it somes time to run the service check, i see
Checking service 'FOO' on host 'BAR' for flapping...
[1410202548.652321] [016.1] [pid=26838] Service is not flapping (0.00% state change).
This is happening to all my services. I tried stopping nagios , removing the retention file, now
everything is stuck in 'pending' and the 'next check' time keeps incrementing.
This is nagios 3.4.1 on Debian Wheezy. My main config file is:
Code: Select all
accept_passive_host_checks=1
accept_passive_service_checks=1
additional_freshness_latency=15
admin_email=root@localhost
admin_pager=pageroot@localhost
auto_reschedule_checks=0
auto_rescheduling_interval=30
auto_rescheduling_window=180
bare_update_check=0
cached_host_check_horizon=15
cached_service_check_horizon=15
cfg_dir=/etc/nagios3/conf.d
cfg_dir=/etc/nagios3/rusa/
cfg_dir=/etc/nagios-plugins/config
cfg_file=/etc/nagios3/commands.cfg
check_external_commands=1
check_for_orphaned_hosts=1
check_for_orphaned_services=1
check_for_updates=1
check_host_freshness=0
check_result_path=/var/lib/nagios3/spool/checkresults
check_result_reaper_frequency=10
check_service_freshness=1
command_check_interval=-1
command_file=/var/lib/nagios3/rw/nagios.cmd
daemon_dumps_core=0
date_format=iso8601
debug_file=/var/log/nagios3/nagios.debug
debug_level=-1
debug_verbosity=1
enable_embedded_perl=1
enable_environment_macros=1
enable_event_handlers=1
enable_flap_detection=0
enable_notifications=0
enable_predictive_host_dependency_checks=1
enable_predictive_service_dependency_checks=1
event_broker_options=-1
event_handler_timeout=30
execute_host_checks=1
execute_service_checks=1
external_command_buffer_slots=4096
high_host_flap_threshold=20.0
high_service_flap_threshold=20.0
host_check_timeout=30
host_freshness_check_interval=60
host_inter_check_delay_method=s
illegal_macro_output_chars=`~$&|'"<>
illegal_object_name_chars=`~!$%^&*|'"<>?,()=
interval_length=60
lock_file=/var/run/nagios3/nagios3.pid
log_archive_path=/var/log/nagios3/archives
log_event_handlers=1
log_external_commands=1
log_file=/var/log/nagios3/nagios.log
log_host_retries=1
log_initial_states=0
log_notifications=1
log_passive_checks=1
log_rotation_method=d
log_service_retries=1
low_host_flap_threshold=5.0
low_service_flap_threshold=5.0
max_check_result_file_age=3600
max_check_result_reaper_time=30
max_concurrent_checks=0
max_debug_file_size=100000000
max_host_check_spread=30
max_service_check_spread=30
nagios_group=nagios
nagios_user=nagios
notification_timeout=30
object_cache_file=/var/cache/nagios3/objects.cache
obsess_over_hosts=0
obsess_over_services=0
ocsp_timeout=5
p1_file=/usr/lib/nagios3/p1.pl
passive_host_checks_are_soft=0
perfdata_timeout=5
precached_object_file=/var/lib/nagios3/objects.precache
process_performance_data=0
resource_file=/etc/nagios3/resource.cfg
retained_contact_host_attribute_mask=0
retained_contact_service_attribute_mask=0
retained_host_attribute_mask=0
retained_process_host_attribute_mask=0
retained_process_service_attribute_mask=0
retained_service_attribute_mask=0
retain_state_information=1
retention_update_interval=60
service_check_timeout=60
service_check_timeout_state=c
service_freshness_check_interval=60
service_inter_check_delay_method=s
service_interleave_factor=s
sleep_time=0.25
soft_state_dependencies=0
state_retention_file=/var/lib/nagios3/retention.dat
status_file=/var/cache/nagios3/status.dat
status_update_interval=10
temp_file=/var/cache/nagios3/nagios.tmp
temp_path=/tmp
translate_passive_host_checks=0
use_aggressive_host_checking=0
use_embedded_perl_implicitly=1
use_large_installation_tweaks=0
use_regexp_matching=0
use_retained_program_state=1
use_retained_scheduling_info=1
use_syslog=1