Nagios process not running on server ...

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Nagios process not running on server ...

Post by tmcdonald »

jyoti22 wrote:Please find attached nagios.debug file for reference.
You do not seem to have attached anything. Can you please try again?
Former Nagios employee
maddev
Posts: 54
Joined: Tue Apr 07, 2015 5:42 am

Re: Nagios process not running on server ...

Post by maddev »

Here's the debug from that machine.
nagios service stays alive for a few seconds when I start it, and then goes down.

Code: Select all

[1434971897.401355] [001.0] [pid=26213] initialize_downtime_data()
[1434971897.401505] [001.0] [pid=26213] xrddefault_read_state_information() start
[1434971897.402023] [001.0] [pid=26213] sort_downtime()
[1434971897.403497] [001.0] [pid=26213] schedule_new_event()
[1434971897.403514] [001.0] [pid=26213] add_event()
[1434971897.403522] [001.0] [pid=26213] schedule_new_event()
[1434971897.403529] [001.0] [pid=26213] add_event()
[1434971897.403535] [001.0] [pid=26213] init_timing_loop() start
[1434971897.403548] [001.0] [pid=26213] check_time_against_period()
[1434971897.403573] [001.0] [pid=26213] check_time_against_period()
[1434971897.403589] [001.0] [pid=26213] _get_matching_timerange()
[1434971897.403621] [001.0] [pid=26213] check_time_against_period()
[1434971897.403740] [001.0] [pid=26213] schedule_new_event()
[1434971897.403759] [001.0] [pid=26213] add_event()
[1434971897.403794] [001.0] [pid=26213] check_time_against_period()
[1434971897.403819] [001.0] [pid=26213] _get_matching_timerange()
[1434971897.403907] [001.0] [pid=26213] schedule_new_event()
[1434971897.403922] [001.0] [pid=26213] add_event()
[1434971897.403943] [001.0] [pid=26213] schedule_new_event()
[1434971897.403952] [001.0] [pid=26213] add_event()
[1434971897.403958] [001.0] [pid=26213] schedule_new_event()
[1434971897.403965] [001.0] [pid=26213] add_event()
[1434971897.403970] [001.0] [pid=26213] schedule_new_event()
[1434971897.403977] [001.0] [pid=26213] add_event()
[1434971897.403982] [001.0] [pid=26213] schedule_new_event()
[1434971897.403989] [001.0] [pid=26213] add_event()
[1434971897.403994] [001.0] [pid=26213] schedule_new_event()
[1434971897.404001] [001.0] [pid=26213] add_event()
[1434971897.404024] [001.0] [pid=26213] schedule_new_event()
[1434971897.404032] [001.0] [pid=26213] add_event()
[1434971897.404038] [001.0] [pid=26213] schedule_new_event()
[1434971897.404045] [001.0] [pid=26213] add_event()
[1434971897.404050] [001.0] [pid=26213] init_timing_loop() end
[1434971897.404061] [001.0] [pid=26213] schedule_new_event()
[1434971897.404068] [001.0] [pid=26213] add_event()
[1434971897.404116] [001.0] [pid=26213] save_status_data()
[1434971897.407368] [001.0] [pid=26225] clear_volatile_macros_r()
[1434971897.407378] [001.0] [pid=26213] event_execution_loop() start
[1434971897.407544] [001.0] [pid=26213] handle_timed_event() start
[1434971897.407573] [001.0] [pid=26213] run_scheduled_host_check()
[1434971897.407592] [001.0] [pid=26213] run_async_host_check(localhost ...)
[1434971897.407601] [001.0] [pid=26213] check_host_check_viability()
[1434971897.407607] [001.0] [pid=26213] check_time_against_period()
[1434971897.407624] [001.0] [pid=26213] _get_matching_timerange()
[1434971897.407640] [001.0] [pid=26213] check_host_dependencies()
[1434971897.407658] [001.0] [pid=26213] adjust_host_check_attempt()
[1434971897.407668] [001.0] [pid=26213] clear_volatile_macros_r()
[1434971897.407679] [001.0] [pid=26213] get_raw_command_line_r()
[1434971897.407749] [001.0] [pid=26213] get_next_valid_time()
[1434971897.407772] [001.0] [pid=26213] _get_matching_timerange()
[1434971897.407851] [001.0] [pid=26213] schedule_host_check()
[1434971897.407878] [001.0] [pid=26213] add_event()
[1434971897.407929] [001.0] [pid=26213] handle_timed_event() end
[1434971906.975027] [001.0] [pid=26213] handle_timed_event() start
[1434971906.975218] [001.0] [pid=26213] save_status_data()
[1434971906.977096] [001.0] [pid=26213] handle_timed_event() end
[1434971906.977113] [001.0] [pid=26213] reschedule_event()
[1434971906.977119] [001.0] [pid=26213] add_event()
[1434971906.977149] [001.0] [pid=26213] handle_timed_event() start
[1434971906.977184] [001.0] [pid=26213] reap_check_results() start
[1434971906.977245] [001.0] [pid=26213] reap_check_results() end
[1434971906.977253] [001.0] [pid=26213] handle_timed_event() end
[1434971906.977260] [001.0] [pid=26213] reschedule_event()
[1434971906.977265] [001.0] [pid=26213] add_event()
[1434971911.975334] [001.0] [pid=26213] handle_timed_event() start
[1434971911.975425] [001.0] [pid=26213] process_host_perfdata_file()
[1434971911.975435] [001.0] [pid=26213] get_raw_command_line_r()
[1434971911.975464] [001.0] [pid=26213] process_macros_r()
[1434971911.975499] [001.0] [pid=26213] my_system_r()
[1434971911.980738] [001.0] [pid=26213] clear_volatile_macros_r()
[1434971911.981218] [001.0] [pid=26213] handle_timed_event() end
[1434971911.981240] [001.0] [pid=26213] reschedule_event()
[1434971911.981247] [001.0] [pid=26213] add_event()
[1434971911.981290] [001.0] [pid=26213] handle_timed_event() start
[1434971911.981310] [001.0] [pid=26213] process_service_perfdata_file()
[1434971911.981320] [001.0] [pid=26213] get_raw_command_line_r()
[1434971911.981328] [001.0] [pid=26213] process_macros_r()
[1434971911.981371] [001.0] [pid=26213] my_system_r()
[1434971911.985429] [001.0] [pid=26213] clear_volatile_macros_r()
[1434971911.985447] [001.0] [pid=26213] handle_timed_event() end
[1434971911.985461] [001.0] [pid=26213] reschedule_event()
[1434971911.985468] [001.0] [pid=26213] add_event()
[1434971916.974822] [001.0] [pid=26213] handle_timed_event() start
[1434971916.974944] [001.0] [pid=26213] save_status_data()
[1434971916.977110] [001.0] [pid=26213] handle_timed_event() end
[1434971916.977126] [001.0] [pid=26213] reschedule_event()
[1434971916.977132] [001.0] [pid=26213] add_event()
[1434971916.977161] [001.0] [pid=26213] handle_timed_event() start
[1434971916.977187] [001.0] [pid=26213] reap_check_results() start
[1434971916.977232] [001.0] [pid=26213] reap_check_results() end
[1434971916.977239] [001.0] [pid=26213] handle_timed_event() end
[1434971916.977246] [001.0] [pid=26213] reschedule_event()
[1434971916.977251] [001.0] [pid=26213] add_event()
[1434971926.975972] [001.0] [pid=26213] handle_timed_event() start
[1434971926.976144] [001.0] [pid=26213] save_status_data()
[1434971926.978314] [001.0] [pid=26213] handle_timed_event() end
[1434971926.978331] [001.0] [pid=26213] reschedule_event()
[1434971926.978337] [001.0] [pid=26213] add_event()
[1434971926.978367] [001.0] [pid=26213] handle_timed_event() start
[1434971926.978390] [001.0] [pid=26213] reap_check_results() start
[1434971926.978431] [001.0] [pid=26213] reap_check_results() end
[1434971926.978438] [001.0] [pid=26213] handle_timed_event() end
[1434971926.978444] [001.0] [pid=26213] reschedule_event()
[1434971926.978449] [001.0] [pid=26213] add_event()
[1434971926.978469] [001.0] [pid=26213] handle_timed_event() start
[1434971926.978487] [001.0] [pid=26213] adjust_check_scheduling() start
[1434971926.978615] [001.0] [pid=26213] adjust_check_scheduling() end
[1434971926.978626] [001.0] [pid=26213] handle_timed_event() end
[1434971926.978632] [001.0] [pid=26213] reschedule_event()
[1434971926.978638] [001.0] [pid=26213] add_event()
[1434971926.978663] [001.0] [pid=26213] handle_timed_event() start
[1434971926.978699] [001.0] [pid=26213] process_service_perfdata_file()
[1434971926.978712] [001.0] [pid=26213] get_raw_command_line_r()
[1434971926.978719] [001.0] [pid=26213] process_macros_r()
[1434971926.978743] [001.0] [pid=26213] my_system_r()
[1434971926.983547] [001.0] [pid=26213] clear_volatile_macros_r()
[1434971926.983569] [001.0] [pid=26213] handle_timed_event() end
[1434971926.983585] [001.0] [pid=26213] reschedule_event()
[1434971926.983591] [001.0] [pid=26213] add_event()
[1434971926.983633] [001.0] [pid=26213] handle_timed_event() start
[1434971926.983653] [001.0] [pid=26213] process_host_perfdata_file()
[1434971926.983663] [001.0] [pid=26213] get_raw_command_line_r()
[1434971926.983670] [001.0] [pid=26213] process_macros_r()
[1434971926.983697] [001.0] [pid=26213] my_system_r()
[1434971926.987734] [001.0] [pid=26213] clear_volatile_macros_r()
[1434971926.987798] [001.0] [pid=26213] handle_timed_event() end
[1434971926.987829] [001.0] [pid=26213] reschedule_event()
[1434971926.987837] [001.0] [pid=26213] add_event()
[1434971936.975490] [001.0] [pid=26213] handle_timed_event() start
[1434971936.975611] [001.0] [pid=26213] save_status_data()
[1434971936.977776] [001.0] [pid=26213] handle_timed_event() end
[1434971936.977795] [001.0] [pid=26213] reschedule_event()
[1434971936.977804] [001.0] [pid=26213] add_event()
[1434971936.977869] [001.0] [pid=26213] handle_timed_event() start
[1434971936.977897] [001.0] [pid=26213] reap_check_results() start
[1434971936.977941] [001.0] [pid=26213] reap_check_results() end
[1434971936.977948] [001.0] [pid=26213] handle_timed_event() end
[1434971936.977955] [001.0] [pid=26213] reschedule_event()
[1434971936.977960] [001.0] [pid=26213] add_event()
[1434971937.225275] [001.0] [pid=26213] handle_timed_event() start
[1434971937.225365] [001.0] [pid=26213] run_scheduled_service_check() start
[1434971937.225379] [001.0] [pid=26213] run_async_service_check()
[1434971937.225385] [001.0] [pid=26213] check_service_check_viability()
[1434971937.225391] [001.0] [pid=26213] check_time_against_period()
[1434971937.225408] [001.0] [pid=26213] check_service_dependencies()
[1434971937.225495] [001.0] [pid=26213] clear_volatile_macros_r()
[1434971937.225521] [001.0] [pid=26213] get_raw_command_line_r()
[1434971937.225535] [001.0] [pid=26213] process_macros_r()
[1434971937.225548] [001.0] [pid=26213] process_macros_r()
[1434971937.225557] [001.0] [pid=26213] process_macros_r()
[1434971937.225565] [001.0] [pid=26213] process_macros_r()
[1434971937.225573] [001.0] [pid=26213] process_macros_r()
[1434971937.225579] [001.0] [pid=26213] process_macros_r()
[1434971937.225587] [001.0] [pid=26213] process_macros_r()
[1434971937.225595] [001.0] [pid=26213] process_macros_r()
[1434971937.225607] [001.0] [pid=26213] process_macros_r()
[1434972039.072937] [001.0] [pid=27415] initialize_downtime_data()
[1434972039.073105] [001.0] [pid=27415] xrddefault_read_state_information() start
[1434972039.073575] [001.0] [pid=27415] sort_downtime()
[1434972039.074934] [001.0] [pid=27415] schedule_new_event()
[1434972039.074953] [001.0] [pid=27415] add_event()
[1434972039.074960] [001.0] [pid=27415] schedule_new_event()
[1434972039.074968] [001.0] [pid=27415] add_event()
[1434972039.074975] [001.0] [pid=27415] init_timing_loop() start
[1434972039.074989] [001.0] [pid=27415] check_time_against_period()
[1434972039.075014] [001.0] [pid=27415] check_time_against_period()
[1434972039.075032] [001.0] [pid=27415] _get_matching_timerange()
[1434972039.075109] [001.0] [pid=27415] check_time_against_period()
[1434972039.075189] [001.0] [pid=27415] schedule_new_event()
[1434972039.075203] [001.0] [pid=27415] add_event()
[1434972039.075226] [001.0] [pid=27415] check_time_against_period()
[1434972039.075243] [001.0] [pid=27415] _get_matching_timerange()
[1434972039.075309] [001.0] [pid=27415] schedule_new_event()
[1434972039.075322] [001.0] [pid=27415] add_event()
[1434972039.075332] [001.0] [pid=27415] schedule_new_event()
[1434972039.075340] [001.0] [pid=27415] add_event()
[1434972039.075346] [001.0] [pid=27415] schedule_new_event()
[1434972039.075354] [001.0] [pid=27415] add_event()
[1434972039.075360] [001.0] [pid=27415] schedule_new_event()
[1434972039.075367] [001.0] [pid=27415] add_event()
[1434972039.075373] [001.0] [pid=27415] schedule_new_event()
[1434972039.075381] [001.0] [pid=27415] add_event()
[1434972039.075389] [001.0] [pid=27415] schedule_new_event()
[1434972039.075396] [001.0] [pid=27415] add_event()
[1434972039.075417] [001.0] [pid=27415] schedule_new_event()
[1434972039.075427] [001.0] [pid=27415] add_event()
[1434972039.075433] [001.0] [pid=27415] schedule_new_event()
[1434972039.075446] [001.0] [pid=27415] add_event()
[1434972039.075452] [001.0] [pid=27415] init_timing_loop() end
[1434972039.075464] [001.0] [pid=27415] schedule_new_event()
[1434972039.075472] [001.0] [pid=27415] add_event()
[1434972039.075488] [001.0] [pid=27415] save_status_data()
[1434972039.077720] [001.0] [pid=27427] clear_volatile_macros_r()
[1434972039.077744] [001.0] [pid=27415] event_execution_loop() start
[1434972039.077970] [001.0] [pid=27415] handle_timed_event() start
[1434972039.078009] [001.0] [pid=27415] run_scheduled_host_check()
[1434972039.078125] [001.0] [pid=27415] run_async_host_check(localhost ...)
[1434972039.078152] [001.0] [pid=27415] check_host_check_viability()
[1434972039.078160] [001.0] [pid=27415] check_time_against_period()
[1434972039.078195] [001.0] [pid=27415] _get_matching_timerange()
[1434972039.078213] [001.0] [pid=27415] check_host_dependencies()
[1434972039.078233] [001.0] [pid=27415] adjust_host_check_attempt()
[1434972039.078243] [001.0] [pid=27415] clear_volatile_macros_r()
[1434972039.078253] [001.0] [pid=27415] get_raw_command_line_r()
[1434972039.078350] [001.0] [pid=27415] get_next_valid_time()
[1434972039.078378] [001.0] [pid=27415] _get_matching_timerange()
[1434972039.078459] [001.0] [pid=27415] schedule_host_check()
[1434972039.078486] [001.0] [pid=27415] add_event()
[1434972039.078538] [001.0] [pid=27415] handle_timed_event() end
[1434972048.975304] [001.0] [pid=27415] handle_timed_event() start
[1434972048.975460] [001.0] [pid=27415] save_status_data()
[1434972048.977500] [001.0] [pid=27415] handle_timed_event() end
[1434972048.977517] [001.0] [pid=27415] reschedule_event()
[1434972048.977523] [001.0] [pid=27415] add_event()
[1434972048.977553] [001.0] [pid=27415] handle_timed_event() start
[1434972048.977590] [001.0] [pid=27415] reap_check_results() start
[1434972048.977641] [001.0] [pid=27415] reap_check_results() end
[1434972048.977648] [001.0] [pid=27415] handle_timed_event() end
[1434972048.977654] [001.0] [pid=27415] reschedule_event()
[1434972048.977660] [001.0] [pid=27415] add_event()
[1434972053.975226] [001.0] [pid=27415] handle_timed_event() start
[1434972053.975317] [001.0] [pid=27415] process_host_perfdata_file()
[1434972053.975326] [001.0] [pid=27415] get_raw_command_line_r()
[1434972053.975340] [001.0] [pid=27415] process_macros_r()
[1434972053.975455] [001.0] [pid=27415] my_system_r()
[1434972053.980637] [001.0] [pid=27415] clear_volatile_macros_r()
[1434972053.980984] [001.0] [pid=27415] handle_timed_event() end
[1434972053.981008] [001.0] [pid=27415] reschedule_event()
[1434972053.981015] [001.0] [pid=27415] add_event()
[1434972053.981056] [001.0] [pid=27415] handle_timed_event() start
[1434972053.981076] [001.0] [pid=27415] process_service_perfdata_file()
[1434972053.981086] [001.0] [pid=27415] get_raw_command_line_r()
[1434972053.981093] [001.0] [pid=27415] process_macros_r()
[1434972053.981119] [001.0] [pid=27415] my_system_r()
[1434972053.985377] [001.0] [pid=27415] clear_volatile_macros_r()
[1434972053.985398] [001.0] [pid=27415] handle_timed_event() end
[1434972053.985413] [001.0] [pid=27415] reschedule_event()
[1434972053.985419] [001.0] [pid=27415] add_event()
[1434972058.975172] [001.0] [pid=27415] handle_timed_event() start
[1434972058.975287] [001.0] [pid=27415] save_status_data()
[1434972058.977268] [001.0] [pid=27415] handle_timed_event() end
[1434972058.977286] [001.0] [pid=27415] reschedule_event()
[1434972058.977293] [001.0] [pid=27415] add_event()
[1434972058.977325] [001.0] [pid=27415] handle_timed_event() start
[1434972058.977351] [001.0] [pid=27415] reap_check_results() start
[1434972058.977397] [001.0] [pid=27415] reap_check_results() end
[1434972058.977405] [001.0] [pid=27415] handle_timed_event() end
[1434972058.977411] [001.0] [pid=27415] reschedule_event()
[1434972058.977416] [001.0] [pid=27415] add_event()
[1434972068.975818] [001.0] [pid=27415] handle_timed_event() start
[1434972068.975952] [001.0] [pid=27415] save_status_data()
[1434972068.977616] [001.0] [pid=27415] handle_timed_event() end
[1434972068.977633] [001.0] [pid=27415] reschedule_event()
[1434972068.977639] [001.0] [pid=27415] add_event()
[1434972068.977670] [001.0] [pid=27415] handle_timed_event() start
[1434972068.977723] [001.0] [pid=27415] reap_check_results() start
[1434972068.977769] [001.0] [pid=27415] reap_check_results() end
[1434972068.977777] [001.0] [pid=27415] handle_timed_event() end
[1434972068.977783] [001.0] [pid=27415] reschedule_event()
[1434972068.977789] [001.0] [pid=27415] add_event()
[1434972068.977810] [001.0] [pid=27415] handle_timed_event() start
[1434972068.977828] [001.0] [pid=27415] adjust_check_scheduling() start
[1434972068.977948] [001.0] [pid=27415] adjust_check_scheduling() end
[1434972068.977958] [001.0] [pid=27415] handle_timed_event() end
[1434972068.977964] [001.0] [pid=27415] reschedule_event()
[1434972068.977977] [001.0] [pid=27415] add_event()
[1434972068.978003] [001.0] [pid=27415] handle_timed_event() start
[1434972068.978021] [001.0] [pid=27415] process_service_perfdata_file()
[1434972068.978030] [001.0] [pid=27415] get_raw_command_line_r()
[1434972068.978038] [001.0] [pid=27415] process_macros_r()
[1434972068.978061] [001.0] [pid=27415] my_system_r()
[1434972068.982922] [001.0] [pid=27415] clear_volatile_macros_r()
[1434972068.982944] [001.0] [pid=27415] handle_timed_event() end
[1434972068.982961] [001.0] [pid=27415] reschedule_event()
[1434972068.982967] [001.0] [pid=27415] add_event()
[1434972068.983010] [001.0] [pid=27415] handle_timed_event() start
[1434972068.983030] [001.0] [pid=27415] process_host_perfdata_file()
[1434972068.983040] [001.0] [pid=27415] get_raw_command_line_r()
[1434972068.983047] [001.0] [pid=27415] process_macros_r()
[1434972068.983075] [001.0] [pid=27415] my_system_r()
[1434972068.986982] [001.0] [pid=27415] clear_volatile_macros_r()
[1434972068.987048] [001.0] [pid=27415] handle_timed_event() end
[1434972068.987080] [001.0] [pid=27415] reschedule_event()
[1434972068.987088] [001.0] [pid=27415] add_event()
[1434972078.975587] [001.0] [pid=27415] handle_timed_event() start
[1434972078.975804] [001.0] [pid=27415] save_status_data()
[1434972078.977710] [001.0] [pid=27415] handle_timed_event() end
[1434972078.977729] [001.0] [pid=27415] reschedule_event()
[1434972078.977736] [001.0] [pid=27415] add_event()
[1434972078.977767] [001.0] [pid=27415] handle_timed_event() start
[1434972078.977794] [001.0] [pid=27415] reap_check_results() start
[1434972078.977843] [001.0] [pid=27415] reap_check_results() end
[1434972078.977851] [001.0] [pid=27415] handle_timed_event() end
[1434972078.977857] [001.0] [pid=27415] reschedule_event()
[1434972078.977863] [001.0] [pid=27415] add_event()
[1434972079.225278] [001.0] [pid=27415] handle_timed_event() start
[1434972079.225327] [001.0] [pid=27415] run_scheduled_service_check() start
[1434972079.225341] [001.0] [pid=27415] run_async_service_check()
[1434972079.225348] [001.0] [pid=27415] check_service_check_viability()
[1434972079.225354] [001.0] [pid=27415] check_time_against_period()
[1434972079.225371] [001.0] [pid=27415] check_service_dependencies()
[1434972079.225466] [001.0] [pid=27415] clear_volatile_macros_r()
[1434972079.225481] [001.0] [pid=27415] get_raw_command_line_r()
[1434972079.225491] [001.0] [pid=27415] process_macros_r()
[1434972079.225503] [001.0] [pid=27415] process_macros_r()
[1434972079.225512] [001.0] [pid=27415] process_macros_r()
[1434972079.225521] [001.0] [pid=27415] process_macros_r()
[1434972079.225530] [001.0] [pid=27415] process_macros_r()
[1434972079.225537] [001.0] [pid=27415] process_macros_r()
[1434972079.225545] [001.0] [pid=27415] process_macros_r()
[1434972079.225554] [001.0] [pid=27415] process_macros_r()
[1434972079.225566] [001.0] [pid=27415] process_macros_r()
[1434972514.497921] [001.0] [pid=31323] initialize_downtime_data()
[1434972514.498072] [001.0] [pid=31323] xrddefault_read_state_information() start
[1434972514.498499] [001.0] [pid=31323] sort_downtime()
[1434972514.506438] [001.0] [pid=31323] schedule_new_event()
[1434972514.506463] [001.0] [pid=31323] add_event()
[1434972514.506471] [001.0] [pid=31323] schedule_new_event()
[1434972514.506479] [001.0] [pid=31323] add_event()
[1434972514.506486] [001.0] [pid=31323] init_timing_loop() start
[1434972514.506501] [001.0] [pid=31323] check_time_against_period()
[1434972514.506529] [001.0] [pid=31323] check_time_against_period()
[1434972514.506547] [001.0] [pid=31323] _get_matching_timerange()
[1434972514.506588] [001.0] [pid=31323] check_time_against_period()
[1434972514.506660] [001.0] [pid=31323] schedule_new_event()
[1434972514.506672] [001.0] [pid=31323] add_event()
[1434972514.506695] [001.0] [pid=31323] check_time_against_period()
[1434972514.506712] [001.0] [pid=31323] _get_matching_timerange()
[1434972514.506776] [001.0] [pid=31323] schedule_new_event()
[1434972514.506788] [001.0] [pid=31323] add_event()
[1434972514.506798] [001.0] [pid=31323] schedule_new_event()
[1434972514.506834] [001.0] [pid=31323] add_event()
[1434972514.506841] [001.0] [pid=31323] schedule_new_event()
[1434972514.506848] [001.0] [pid=31323] add_event()
[1434972514.506854] [001.0] [pid=31323] schedule_new_event()
[1434972514.506861] [001.0] [pid=31323] add_event()
[1434972514.506866] [001.0] [pid=31323] schedule_new_event()
[1434972514.506873] [001.0] [pid=31323] add_event()
[1434972514.506878] [001.0] [pid=31323] schedule_new_event()
[1434972514.506885] [001.0] [pid=31323] add_event()
[1434972514.506907] [001.0] [pid=31323] schedule_new_event()
[1434972514.506916] [001.0] [pid=31323] add_event()
[1434972514.506922] [001.0] [pid=31323] schedule_new_event()
[1434972514.506929] [001.0] [pid=31323] add_event()
[1434972514.506934] [001.0] [pid=31323] init_timing_loop() end
[1434972514.506947] [001.0] [pid=31323] schedule_new_event()
[1434972514.506955] [001.0] [pid=31323] add_event()
[1434972514.506977] [001.0] [pid=31323] save_status_data()
[1434972514.510660] [001.0] [pid=31323] event_execution_loop() start
[1434972514.510755] [001.0] [pid=31323] handle_timed_event() start
[1434972514.510791] [001.0] [pid=31323] run_scheduled_host_check()
[1434972514.510830] [001.0] [pid=31323] run_async_host_check(localhost ...)
[1434972514.510847] [001.0] [pid=31323] check_host_check_viability()
[1434972514.510854] [001.0] [pid=31323] check_time_against_period()
[1434972514.510876] [001.0] [pid=31323] _get_matching_timerange()
[1434972514.510893] [001.0] [pid=31323] check_host_dependencies()
[1434972514.510912] [001.0] [pid=31323] adjust_host_check_attempt()
[1434972514.510922] [001.0] [pid=31323] clear_volatile_macros_r()
[1434972514.510935] [001.0] [pid=31323] get_raw_command_line_r()
[1434972514.510989] [001.0] [pid=31323] get_next_valid_time()
[1434972514.511011] [001.0] [pid=31323] _get_matching_timerange()
[1434972514.511092] [001.0] [pid=31323] schedule_host_check()
[1434972514.511120] [001.0] [pid=31323] add_event()
[1434972514.511170] [001.0] [pid=31323] handle_timed_event() end
[1434972514.519368] [001.0] [pid=31334] clear_volatile_macros_r()
[1434972523.975494] [001.0] [pid=31323] handle_timed_event() start
[1434972523.975675] [001.0] [pid=31323] save_status_data()
[1434972523.978015] [001.0] [pid=31323] handle_timed_event() end
[1434972523.978032] [001.0] [pid=31323] reschedule_event()
[1434972523.978039] [001.0] [pid=31323] add_event()
[1434972523.978069] [001.0] [pid=31323] handle_timed_event() start
[1434972523.978119] [001.0] [pid=31323] reap_check_results() start
[1434972523.978177] [001.0] [pid=31323] reap_check_results() end
[1434972523.978184] [001.0] [pid=31323] handle_timed_event() end
[1434972523.978190] [001.0] [pid=31323] reschedule_event()
[1434972523.978196] [001.0] [pid=31323] add_event()
[1434972528.975315] [001.0] [pid=31323] handle_timed_event() start
[1434972528.975408] [001.0] [pid=31323] process_host_perfdata_file()
[1434972528.975420] [001.0] [pid=31323] get_raw_command_line_r()
[1434972528.975435] [001.0] [pid=31323] process_macros_r()
[1434972528.975585] [001.0] [pid=31323] my_system_r()
[1434972528.980555] [001.0] [pid=31323] clear_volatile_macros_r()
[1434972528.980886] [001.0] [pid=31323] handle_timed_event() end
[1434972528.980916] [001.0] [pid=31323] reschedule_event()
[1434972528.980925] [001.0] [pid=31323] add_event()
[1434972528.980963] [001.0] [pid=31323] handle_timed_event() start
[1434972528.980984] [001.0] [pid=31323] process_service_perfdata_file()
[1434972528.980994] [001.0] [pid=31323] get_raw_command_line_r()
[1434972528.981003] [001.0] [pid=31323] process_macros_r()
[1434972528.981027] [001.0] [pid=31323] my_system_r()
[1434972528.985064] [001.0] [pid=31323] clear_volatile_macros_r()
[1434972528.985088] [001.0] [pid=31323] handle_timed_event() end
[1434972528.985105] [001.0] [pid=31323] reschedule_event()
[1434972528.985112] [001.0] [pid=31323] add_event()
[1434972533.675718] [001.0] [pid=31535] clear_volatile_macros_r()
[1434972533.974850] [001.0] [pid=31323] handle_timed_event() start
[1434972533.974933] [001.0] [pid=31323] save_status_data()
[1434972533.977009] [001.0] [pid=31323] handle_timed_event() end
[1434972533.977026] [001.0] [pid=31323] reschedule_event()
[1434972533.977033] [001.0] [pid=31323] add_event()
[1434972533.977063] [001.0] [pid=31323] handle_timed_event() start
[1434972533.977092] [001.0] [pid=31323] reap_check_results() start
[1434972533.977135] [001.0] [pid=31323] reap_check_results() end
[1434972533.977142] [001.0] [pid=31323] handle_timed_event() end
[1434972533.977149] [001.0] [pid=31323] reschedule_event()
[1434972533.977154] [001.0] [pid=31323] add_event()
[1434972543.975545] [001.0] [pid=31323] handle_timed_event() start
[1434972543.975681] [001.0] [pid=31323] save_status_data()
[1434972543.977404] [001.0] [pid=31323] handle_timed_event() end
[1434972543.977420] [001.0] [pid=31323] reschedule_event()
[1434972543.977426] [001.0] [pid=31323] add_event()
[1434972543.977455] [001.0] [pid=31323] handle_timed_event() start
[1434972543.977478] [001.0] [pid=31323] reap_check_results() start
[1434972543.977519] [001.0] [pid=31323] reap_check_results() end
[1434972543.977526] [001.0] [pid=31323] handle_timed_event() end
[1434972543.977532] [001.0] [pid=31323] reschedule_event()
[1434972543.977537] [001.0] [pid=31323] add_event()
[1434972543.977557] [001.0] [pid=31323] handle_timed_event() start
[1434972543.977575] [001.0] [pid=31323] adjust_check_scheduling() start
[1434972543.977713] [001.0] [pid=31323] adjust_check_scheduling() end
[1434972543.977724] [001.0] [pid=31323] handle_timed_event() end
[1434972543.977731] [001.0] [pid=31323] reschedule_event()
[1434972543.977736] [001.0] [pid=31323] add_event()
[1434972543.977762] [001.0] [pid=31323] handle_timed_event() start
[1434972543.977780] [001.0] [pid=31323] process_service_perfdata_file()
[1434972543.977789] [001.0] [pid=31323] get_raw_command_line_r()
[1434972543.977797] [001.0] [pid=31323] process_macros_r()
[1434972543.977819] [001.0] [pid=31323] my_system_r()
[1434972543.982445] [001.0] [pid=31323] clear_volatile_macros_r()
[1434972543.982466] [001.0] [pid=31323] handle_timed_event() end
[1434972543.982481] [001.0] [pid=31323] reschedule_event()
[1434972543.982489] [001.0] [pid=31323] add_event()
[1434972543.982524] [001.0] [pid=31323] handle_timed_event() start
[1434972543.982545] [001.0] [pid=31323] process_host_perfdata_file()
[1434972543.982554] [001.0] [pid=31323] get_raw_command_line_r()
[1434972543.982563] [001.0] [pid=31323] process_macros_r()
[1434972543.982591] [001.0] [pid=31323] my_system_r()
[1434972543.986511] [001.0] [pid=31323] clear_volatile_macros_r()
[1434972543.986577] [001.0] [pid=31323] handle_timed_event() end
[1434972543.986596] [001.0] [pid=31323] reschedule_event()
[1434972543.986603] [001.0] [pid=31323] add_event()
[1434972553.975181] [001.0] [pid=31323] handle_timed_event() start
[1434972553.975288] [001.0] [pid=31323] save_status_data()
[1434972553.977181] [001.0] [pid=31323] handle_timed_event() end
[1434972553.977198] [001.0] [pid=31323] reschedule_event()
[1434972553.977206] [001.0] [pid=31323] add_event()
[1434972553.977235] [001.0] [pid=31323] handle_timed_event() start
[1434972553.977264] [001.0] [pid=31323] reap_check_results() start
[1434972553.977324] [001.0] [pid=31323] reap_check_results() end
[1434972553.977333] [001.0] [pid=31323] handle_timed_event() end
[1434972553.977341] [001.0] [pid=31323] reschedule_event()
[1434972553.977347] [001.0] [pid=31323] add_event()
[1434972554.224745] [001.0] [pid=31323] handle_timed_event() start
[1434972554.224829] [001.0] [pid=31323] run_scheduled_service_check() start
[1434972554.224860] [001.0] [pid=31323] run_async_service_check()
[1434972554.224867] [001.0] [pid=31323] check_service_check_viability()
[1434972554.224874] [001.0] [pid=31323] check_time_against_period()
[1434972554.224894] [001.0] [pid=31323] check_service_dependencies()
[1434972554.225044] [001.0] [pid=31323] clear_volatile_macros_r()
[1434972554.225060] [001.0] [pid=31323] get_raw_command_line_r()
[1434972554.225071] [001.0] [pid=31323] process_macros_r()
[1434972554.225085] [001.0] [pid=31323] process_macros_r()
[1434972554.225110] [001.0] [pid=31323] process_macros_r()
[1434972554.225121] [001.0] [pid=31323] process_macros_r()
[1434972554.225130] [001.0] [pid=31323] process_macros_r()
[1434972554.225138] [001.0] [pid=31323] process_macros_r()
[1434972554.225145] [001.0] [pid=31323] process_macros_r()
[1434972554.225155] [001.0] [pid=31323] process_macros_r()
[1434972554.225168] [001.0] [pid=31323] process_macros_r()


Attached a screenprint of system status screen from XI.

I tried verifying the configs. returns 0 error and warning. On top of this I have removed all the services, hosts, templates and any such config data. Created only one service and host under ccm for applying configurations.

Let me know If u require anything else on this.
You do not have the required permissions to view the files attached to this post.
maddev
Posts: 54
Joined: Tue Apr 07, 2015 5:42 am

Re: Nagios process not running on server ...

Post by maddev »

copy of my nagios.cfg if you may need it

Code: Select all

# MODIFIED
admin_email=root@localhost
admin_pager=root@localhost
translate_passive_host_checks=1
log_event_handlers=0
use_large_installation_tweaks=1
enable_environment_macros=0


# NDOUtils module
broker_module=/usr/local/nagios/bin/ndomod.o config_file=/usr/local/nagios/etc/ndomod.cfg


# PNP settings - bulk mode with NCPD
process_performance_data=1
# service performance data
service_perfdata_file=/usr/local/nagios/var/service-perfdata
service_perfdata_file_template=DATATYPE::SERVICEPERFDATA\tTIMET::$TIMET$\tHOSTNAME::$HOSTNAME$\tSERVICEDESC::$SERVICEDESC$\tSERVICEPERFDATA::$SERVICEPERFDATA$\t
SERVICECHECKCOMMAND::$SERVICECHECKCOMMAND$\tHOSTSTATE::$HOSTSTATE$\tHOSTSTATETYPE::$HOSTSTATETYPE$\tSERVICESTATE::$SERVICESTATE$\tSERVICESTATETYPE::$SERVICESTAT
ETYPE$\tSERVICEOUTPUT::$SERVICEOUTPUT$
service_perfdata_file_mode=a
service_perfdata_file_processing_interval=15
service_perfdata_file_processing_command=process-service-perfdata-file-bulk
# host performance data
host_perfdata_file=/usr/local/nagios/var/host-perfdata
host_perfdata_file_template=DATATYPE::HOSTPERFDATA\tTIMET::$TIMET$\tHOSTNAME::$HOSTNAME$\tHOSTPERFDATA::$HOSTPERFDATA$\tHOSTCHECKCOMMAND::$HOSTCHECKCOMMAND$\tHO
STSTATE::$HOSTSTATE$\tHOSTSTATETYPE::$HOSTSTATETYPE$\tHOSTOUTPUT::$HOSTOUTPUT$
host_perfdata_file_mode=a
host_perfdata_file_processing_interval=15
host_perfdata_file_processing_command=process-host-perfdata-file-bulk


# OBJECTS - UNMODIFIED
#cfg_file=/usr/local/nagios/etc/objects/commands.cfg
#cfg_file=/usr/local/nagios/etc/objects/contacts.cfg
#cfg_file=/usr/local/nagios/etc/objects/localhost.cfg
#cfg_file=/usr/local/nagios/etc/objects/templates.cfg
#cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg


# STATIC OBJECT DEFINITIONS (THESE DON'T GET EXPORTED/IMPORTED BY NAGIOSQL)
cfg_dir=/usr/local/nagios/etc/static

# OBJECTS EXPORTED FROM NAGIOSQL
cfg_file=/usr/local/nagios/etc/contacttemplates.cfg
cfg_file=/usr/local/nagios/etc/contactgroups.cfg
cfg_file=/usr/local/nagios/etc/contacts.cfg
cfg_file=/usr/local/nagios/etc/timeperiods.cfg
cfg_file=/usr/local/nagios/etc/commands.cfg
cfg_file=/usr/local/nagios/etc/hostgroups.cfg
cfg_file=/usr/local/nagios/etc/servicegroups.cfg
cfg_file=/usr/local/nagios/etc/hosttemplates.cfg
cfg_file=/usr/local/nagios/etc/servicetemplates.cfg
cfg_file=/usr/local/nagios/etc/servicedependencies.cfg
cfg_file=/usr/local/nagios/etc/serviceescalations.cfg
cfg_file=/usr/local/nagios/etc/hostdependencies.cfg
cfg_file=/usr/local/nagios/etc/hostescalations.cfg
cfg_file=/usr/local/nagios/etc/hostextinfo.cfg
cfg_file=/usr/local/nagios/etc/serviceextinfo.cfg
cfg_dir=/usr/local/nagios/etc/hosts
cfg_dir=/usr/local/nagios/etc/services

# GLOBAL EVENT HANDLERS
global_host_event_handler=xi_host_event_handler
global_service_event_handler=xi_service_event_handler



# UNMODIFIED
accept_passive_host_checks=1
accept_passive_service_checks=1
additional_freshness_latency=15
auto_reschedule_checks=1
auto_rescheduling_interval=30
auto_rescheduling_window=45
bare_update_check=0
cached_host_check_horizon=15
cached_service_check_horizon=15
check_external_commands=1
check_for_orphaned_hosts=1
check_for_orphaned_services=1
check_for_updates=1
check_host_freshness=0
check_result_path=/usr/local/nagios/var/spool/checkresults
check_result_reaper_frequency=10
check_service_freshness=1
command_file=/usr/local/nagios/var/rw/nagios.cmd
daemon_dumps_core=0
date_format=us
debug_file=/usr/local/nagios/var/nagios.debug
debug_level=0
debug_verbosity=1
enable_event_handlers=1
enable_flap_detection=1
enable_notifications=1
enable_predictive_host_dependency_checks=1
enable_predictive_service_dependency_checks=1
event_broker_options=-1
event_handler_timeout=30
execute_host_checks=1
execute_service_checks=1
high_host_flap_threshold=20.0
high_service_flap_threshold=20.0
host_check_timeout=30
host_freshness_check_interval=60
host_inter_check_delay_method=s
illegal_macro_output_chars=`~$&|'"<>
illegal_object_name_chars=`~!$%^&*|'"<>?,()=
interval_length=60
lock_file=/usr/local/nagios/var/nagios.lock
log_archive_path=/usr/local/nagios/var/archives
log_external_commands=0
log_file=/usr/local/nagios/var/nagios.log
log_host_retries=1
log_initial_states=0
log_notifications=1
log_passive_checks=0
log_rotation_method=d
log_service_retries=1
low_host_flap_threshold=5.0
low_service_flap_threshold=5.0
max_check_result_file_age=3600
max_check_result_reaper_time=30
max_concurrent_checks=0
max_debug_file_size=1000000
max_host_check_spread=30
max_service_check_spread=30
nagios_group=nagios
nagios_user=nagios
notification_timeout=30
object_cache_file=/usr/local/nagios/var/objects.cache
obsess_over_hosts=0
obsess_over_services=0
ocsp_timeout=5
passive_host_checks_are_soft=0
perfdata_timeout=5
precached_object_file=/usr/local/nagios/var/objects.precache
resource_file=/usr/local/nagios/etc/resource.cfg
retained_contact_host_attribute_mask=0
retained_contact_service_attribute_mask=0
retained_host_attribute_mask=0
retained_process_host_attribute_mask=0
retained_process_service_attribute_mask=0
retained_service_attribute_mask=0
retain_state_information=1
retention_update_interval=60
service_check_timeout=60
service_freshness_check_interval=60
service_inter_check_delay_method=s
service_interleave_factor=s
soft_state_dependencies=0
state_retention_file=/usr/local/nagios/var/retention.dat
status_file=/usr/local/nagios/var/status.dat
status_update_interval=10
temp_file=/usr/local/nagios/var/nagios.tmp
temp_path=/tmp
use_aggressive_host_checking=0
use_regexp_matching=0
use_retained_program_state=1
use_retained_scheduling_info=1
use_syslog=1
use_true_regexp_matching=0

broker_module=/usr/lib64/mod_gearman/mod_gearman.o config=/etc/mod_gearman/mod_gearman_neb.conf eventhandler=no


User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Nagios process not running on server ...

Post by tgriep »

Can you check the /var/log/messages log file for any errors?

Can you comment out the following line in the nagios.cfg file to rule out any Mod Gearman issues?

Code: Select all

broker_module=/usr/lib64/mod_gearman/mod_gearman.o config=/etc/mod_gearman/mod_gearman_neb.conf eventhandler=no
Try to start the nagios process and see if it keeps on running.
Be sure to check out our Knowledgebase for helpful articles and solutions!
jyoti22
Posts: 254
Joined: Mon Mar 23, 2015 4:50 am

Re: Nagios process not running on server ...

Post by jyoti22 »

Please find attached /var/log/messages file. I don't find any error in it.
Regarding nagios.config, I have commented out mod-gearman related line and restarted nagios . In nagios monitoring engine started after nagios process start But in few minutes again it went down.

Code: Select all

Jun 22 13:37:14 nagiosxi nagios: LOG VERSION: 2.0
Jun 22 13:37:14 nagiosxi nagios: qh: Socket '/usr/local/nagios/var/rw/nagios.qh' successfully initialized
Jun 22 13:37:14 nagiosxi nagios: qh: core query handler registered
Jun 22 13:37:14 nagiosxi nagios: nerd: Channel hostchecks registered successfully
Jun 22 13:37:14 nagiosxi nagios: nerd: Channel servicechecks registered successfully
Jun 22 13:37:14 nagiosxi nagios: nerd: Channel opathchecks registered successfully
Jun 22 13:37:14 nagiosxi nagios: nerd: Fully initialized and ready to rock!
Jun 22 13:37:14 nagiosxi nagios: wproc: Successfully registered manager as @wproc with query handler
Jun 22 13:37:14 nagiosxi nagios: wproc: Registry request: name=Core Worker 1416;pid=1416
Jun 22 13:37:14 nagiosxi nagios: wproc: Registry request: name=Core Worker 1417;pid=1417
Jun 22 13:37:14 nagiosxi nagios: wproc: Registry request: name=Core Worker 1420;pid=1420
Jun 22 13:37:14 nagiosxi nagios: wproc: Registry request: name=Core Worker 1418;pid=1418
Jun 22 13:37:14 nagiosxi nagios: wproc: Registry request: name=Core Worker 1419;pid=1419
Jun 22 13:37:14 nagiosxi nagios: wproc: Registry request: name=Core Worker 1421;pid=1421
Jun 22 13:37:14 nagiosxi nagios: mod_gearman: initialized version 1.5.0b1 (libgearman 1.1.8)
Jun 22 13:37:14 nagiosxi nagios: Event broker module '/usr/lib64/mod_gearman/mod_gearman.o' initialized successfully.
Jun 22 13:37:14 nagiosxi nagios: ndomod: NDOMOD 2.0.0 (02-28-2014) Copyright (c) 2009 Nagios Core Development Team and Community Contributors
Jun 22 13:37:14 nagiosxi nagios: ndomod: Could not open data sink!  I'll keep trying, but some output may get lost...
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for process data
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for log data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for system command data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for event handler data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for notification data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for comment data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for downtime data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for flapping data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for program status data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for host status data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for service status data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for adaptive program data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for adaptive host data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for adaptive service data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for external command data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for aggregated status data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for retention data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for contact data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for contact notification data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for acknowledgement data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for state change data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for contact status data'
Jun 22 13:37:14 nagiosxi nagios: ndomod registered for adaptive contact data'
Jun 22 13:37:14 nagiosxi nagios: Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully.
Jun 22 13:37:14 nagiosxi nagios: Warning: Service 'local' on host 'localhost' has no default contacts or contactgroups defined!
Jun 22 13:37:14 nagiosxi nagios: Warning: Service 'local' on host 'localhost' has no check time period defined!
Jun 22 13:37:14 nagiosxi nagios: Warning: Service 'local' on host 'localhost' has no notification time period defined!
Jun 22 13:37:14 nagiosxi nagios: Warning: Host 'localhost' has no default contacts or contactgroups defined!
Jun 22 13:37:14 nagiosxi nagios: Successfully launched command file worker with pid 1422
Jun 22 13:43:34 nagiosxi nagios: Nagios 4.0.8 starting... (PID=2469)
Jun 22 13:43:34 nagiosxi nagios: Local time is Mon Jun 22 13:43:34 CDT 2015
Jun 22 13:43:34 nagiosxi nagios: LOG VERSION: 2.0
Jun 22 13:43:34 nagiosxi nagios: qh: Socket '/usr/local/nagios/var/rw/nagios.qh' successfully initialized
Jun 22 13:43:34 nagiosxi nagios: qh: core query handler registered
Jun 22 13:43:34 nagiosxi nagios: nerd: Channel hostchecks registered successfully
Jun 22 13:43:34 nagiosxi nagios: nerd: Channel servicechecks registered successfully
Jun 22 13:43:34 nagiosxi nagios: nerd: Channel opathchecks registered successfully
Jun 22 13:43:34 nagiosxi nagios: nerd: Fully initialized and ready to rock!
Jun 22 13:43:34 nagiosxi nagios: wproc: Successfully registered manager as @wproc with query handler
Jun 22 13:43:34 nagiosxi nagios: wproc: Registry request: name=Core Worker 2471;pid=2471
Jun 22 13:43:34 nagiosxi nagios: wproc: Registry request: name=Core Worker 2472;pid=2472
Jun 22 13:43:34 nagiosxi nagios: wproc: Registry request: name=Core Worker 2474;pid=2474
Jun 22 13:43:34 nagiosxi nagios: wproc: Registry request: name=Core Worker 2473;pid=2473
Jun 22 13:43:34 nagiosxi nagios: wproc: Registry request: name=Core Worker 2475;pid=2475
Jun 22 13:43:34 nagiosxi nagios: wproc: Registry request: name=Core Worker 2476;pid=2476
Jun 22 13:43:34 nagiosxi nagios: mod_gearman: initialized version 1.5.0b1 (libgearman 1.1.8)
Jun 22 13:43:34 nagiosxi nagios: Event broker module '/usr/lib64/mod_gearman/mod_gearman.o' initialized successfully.
Jun 22 13:43:34 nagiosxi nagios: ndomod: NDOMOD 2.0.0 (02-28-2014) Copyright (c) 2009 Nagios Core Development Team and Community Contributors
Jun 22 13:43:34 nagiosxi nagios: ndomod: Successfully connected to data sink.  0 queued items to flush.
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for process data
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for log data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for system command data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for event handler data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for notification data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for comment data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for downtime data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for flapping data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for program status data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for host status data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for service status data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for adaptive program data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for adaptive host data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for adaptive service data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for external command data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for aggregated status data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for retention data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for contact data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for contact notification data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for acknowledgement data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for state change data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for contact status data'
Jun 22 13:43:34 nagiosxi nagios: ndomod registered for adaptive contact data'
Jun 22 13:43:34 nagiosxi nagios: Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully.
Jun 22 13:43:34 nagiosxi nagios: Warning: Service 'local' on host 'localhost' has no default contacts or contactgroups defined!
Jun 22 13:43:34 nagiosxi nagios: Warning: Service 'local' on host 'localhost' has no check time period defined!
Jun 22 13:43:34 nagiosxi nagios: Warning: Service 'local' on host 'localhost' has no notification time period defined!
Jun 22 13:43:34 nagiosxi nagios: Warning: Host 'localhost' has no default contacts or contactgroups defined!
Jun 22 13:43:34 nagiosxi nagios: Successfully launched command file worker with pid 2481
Jun 22 22:08:02 nagiosxi auditd[1329]: Audit daemon rotating log files
avijit_bhardwaj
Posts: 17
Joined: Wed Feb 25, 2015 11:56 pm

Re: Nagios process not running on server ...

Post by avijit_bhardwaj »

Hi tgriep,

After commenting out gearman line from cfg file..it worked absolutely fine. Thanks for your help..however we are still wondering what caused gearman to stop nagios daemon. We have been running it since few months did not faced any issue until now. Could you suggest where can we look for the root cause.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Nagios process not running on server ...

Post by tgriep »

One step at a time 8)

The gearman logs are in this folder
/var/log/mod_gearman

The gearman server log file is called mod_gearman_neb.log and the worker's log file is named mod_gearman_worker.log.

Take a look at them and see if there are any errors that you can find that can help debug the issue .
Be sure to check out our Knowledgebase for helpful articles and solutions!
jyoti22
Posts: 254
Joined: Mon Mar 23, 2015 4:50 am

Re: Nagios process not running on server ...

Post by jyoti22 »

Hi tgriep
I can see below error in mod_gearman_neb.log. And mod_gearman_worker.log do not have any logs and is not updated since 16 June

Code: Select all

[2015-06-23 03:13:18][24665][ERROR] Raw check command for host 'localhost' was NULL - aborting.
[2015-06-23 03:14:12][24768][ERROR] sending job to gearmand failed: flush(GEARMAN_COULD_NOT_CONNECT) localhost:4730 -> libgearman/connection.cc:745
[2015-06-23 04:44:59][15542][ERROR] sending job to gearmand failed: flush(GEARMAN_COULD_NOT_CONNECT) localhost:4730 -> libgearman/connection.cc:745

Code: Select all

[root@nagiosxi mod_gearman]# ls -l *.log
-rw-r--r--. 1 nagios nagios 15683 Jun 23 04:44 mod_gearman_neb.log
-rw-r--r--  1 nagios users      0 Jun 14 03:18 mod_gearman_worker.log
[root@nagiosxi mod_gearman]#
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Nagios process not running on server ...

Post by abrist »

Is gearmand running?

Code: Select all

service gearmand status
If not, start it:

Code: Select all

service gearmand start
Also, was XI updated after you resolved your license issue?
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
jyoti22
Posts: 254
Joined: Mon Mar 23, 2015 4:50 am

Re: Nagios process not running on server ...

Post by jyoti22 »

Regarding your first question, gearman process is running

Code: Select all

[root@nagiosxi ~]# service gearmand status
gearmand (pid  9118) is running...
[root@nagiosxi ~]# 
After extending license, When nagios was not running, We tried to upgrade XI to resolve the issue.
Locked