Passive checks work once then timeout for the next 2 minutes

An open discussion forum for obtaining help with Nagios Core. Nagios Core users of all experience levels are welcome here. Subforum have been created for the discussion of Nagios Core and Nagios Plugin development.

NOTE: The SourceForge.net mailing lists have been deprecated in favor of this forum in order to expedite support and provide additional features not available on the old mailing list.

Passive checks work once then timeout for the next 2 minutes

Postby octom » Tue Jan 09, 2018 3:59 pm

This is happening with calls to both send_nsca and send_nrdp.php.

I can send one passive alert from a remote host and it works fine.

Any other sends in the next two minutes will time out.

After two minutes a send will succeed again, but then another two minutes must pass before another send will work.

This seems to affect messages from any remote hosts. Once a passive alert is processed from one host, no other remote hosts can send alerts for two minutes.

I know I have a lot more investigating to do on my own, including upgrading core from a 4.0 version, but I was wondering if anyone had seen this before and could provide some direction for diagnosis.

One potential source of the problem is that the monitoring server is running on a VM which is using IP tables on the hypervisor to forward messages on either port 5667 for NSCA or 80 for NRDP.

Does this symptom sound familiar to anyone?.
octom
 
Posts: 4
Joined: Mon May 19, 2014 11:43 am

Re: Passive checks work once then timeout for the next 2 min

Postby dwhitfield » Tue Jan 09, 2018 5:54 pm

Do you see errors...anywhere? It's pretty much the end of our work day, so any errors we'll take a look at tomorrow, but you should take a look at https://support.nagios.com/kb/article.php?id=480 to give you an idea of some places where you might see some errors (and a potential fix).
Be sure to check out our Knowledgebase for helpful articles and solutions!
User avatar
dwhitfield
The Doctor
 
Posts: 4307
Joined: Wed Sep 21, 2016 10:29 am
Location: Nagios Enterprises, LLC

Re: Passive checks work once then timeout for the next 2 min

Postby octom » Wed Jan 10, 2018 9:08 am

No errors. The first alert shows up in the correct logs, but subsequent alerts don't show up anywhere for 2 minutes. It is possible that the nagios monitor on the VM doesn't see alerts in a two minute window following a successful alert. I will run some packet captures to confirm or refute that theory. But if that is the case, I have no idea what would cause that behavior so my only question at this point is if anyone has encountered this anomaly and if so, what was done to address it.
octom
 
Posts: 4
Joined: Mon May 19, 2014 11:43 am

Re: Passive checks work once then timeout for the next 2 min

Postby tgriep » Wed Jan 10, 2018 4:48 pm

Do you see the data show up in the check results folder on the Nagios server from the remote servers during the 2 minutes when nothing is displayed in the GUI?
Can you post your nagios.cfg file so we can view it's settings?
Also, run the following command as root in the Nagios server and post the output.
Code: Select all
ps -ef --cols=300


Thanks
Be sure to check out our Knowledgebase for helpful articles and solutions!
User avatar
tgriep
Madmin
 
Posts: 6349
Joined: Thu Oct 30, 2014 9:02 am

Re: Passive checks work once then timeout for the next 2 min

Postby octom » Thu Jan 11, 2018 5:47 pm

I ran a tcpdump and for nsca nothing gets through to port 5667 during the two-minute window. So I am assuming this is some networking and/or/ vm issue.

But to answer your questions, nothing shows up in the check logs during the dead two minutes.

Here is my nagios.cfg:
Code: Select all
[nagios@xxxxxxxxx oc_objects]$ cat ~/etc/nagios.cfg | grep -v '^#\|^$'
log_file=/var/log/nagios/nagios.log
cfg_file=/usr/local/nagios/etc/objects/oc_objects/commands.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/notify_commands.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/contacts.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/timeperiods.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/host-templates.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/oc_servicegroups.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/oc_hostgroups.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/oc_commands.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/localhost.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/production-exceptions.cfg
cfg_file=/usr/local/nagios/etc/objects/BAS-OC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/CH2-nOC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/CH2-OC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/DC3-nOC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/DC3-OC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/FR2-OC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/INTX-OC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/NULL-host-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/NY4-OC-hosts_ss.cfg
cfg_file=/usr/local/nagios/etc/objects/oc_objects/oc_services.cfg
cfg_file=/usr/local/nagios/etc/objects/STATIC-HOSTS/hypervisors.cfg
cfg_file=/usr/local/nagios/etc/objects/STATIC-HOSTS/internal-servs.cfg
cfg_file=/usr/local/nagios/etc/objects/STATIC-HOSTS/windoze-servs.cfg
cfg_file=/usr/local/nagios/etc/objects/STATIC-HOSTS/lanes-VMs.cfg
cfg_file=/usr/local/nagios/etc/objects/STATIC-HOSTS/other-VMs.cfg
object_cache_file=/usr/local/nagios/var/objects.cache
precached_object_file=/usr/local/nagios/var/objects.precache
resource_file=/usr/local/nagios/etc/resource.cfg
status_file=/usr/local/nagios/var/status.dat
status_update_interval=10
nagios_user=nagios
nagios_group=nagios
check_external_commands=1
command_file=/usr/local/nagios/var/rw/nagios.cmd
lock_file=/usr/local/nagios/var/nagios.lock
temp_file=/usr/local/nagios/var/nagios.tmp
temp_path=/tmp
event_broker_options=0
log_rotation_method=d
log_archive_path=/usr/local/nagios/var/archives
use_syslog=1
log_notifications=1
log_service_retries=1
log_host_retries=1
log_event_handlers=1
log_initial_states=0
log_current_states=1
log_external_commands=1
log_passive_checks=1
service_inter_check_delay_method=s
max_service_check_spread=30
service_interleave_factor=s
host_inter_check_delay_method=s
max_host_check_spread=30
max_concurrent_checks=0
check_result_reaper_frequency=10
max_check_result_reaper_time=30
check_result_path=/usr/local/nagios/var/spool/checkresults
max_check_result_file_age=3600
cached_host_check_horizon=15
cached_service_check_horizon=15
enable_predictive_host_dependency_checks=1
enable_predictive_service_dependency_checks=1
soft_state_dependencies=0
auto_reschedule_checks=0
auto_rescheduling_interval=30
auto_rescheduling_window=180
service_check_timeout=60
host_check_timeout=30
event_handler_timeout=30
notification_timeout=30
ocsp_timeout=5
perfdata_timeout=5
retain_state_information=1
state_retention_file=/usr/local/nagios/var/retention.dat
retention_update_interval=60
use_retained_program_state=1
use_retained_scheduling_info=1
retained_host_attribute_mask=0
retained_service_attribute_mask=0
retained_process_host_attribute_mask=0
retained_process_service_attribute_mask=0
retained_contact_host_attribute_mask=0
retained_contact_service_attribute_mask=0
interval_length=60
check_for_updates=1
bare_update_check=0
use_aggressive_host_checking=0
execute_service_checks=1
accept_passive_service_checks=1
execute_host_checks=1
accept_passive_host_checks=1
enable_notifications=1
enable_event_handlers=1
process_performance_data=0
obsess_over_services=0
obsess_over_hosts=0
translate_passive_host_checks=0
passive_host_checks_are_soft=0
check_for_orphaned_services=1
check_for_orphaned_hosts=1
check_service_freshness=1
service_freshness_check_interval=60
service_check_timeout_state=c
check_host_freshness=0
host_freshness_check_interval=60
additional_freshness_latency=15
enable_flap_detection=1
low_service_flap_threshold=5.0
high_service_flap_threshold=20.0
low_host_flap_threshold=5.0
high_host_flap_threshold=20.0
date_format=us
illegal_object_name_chars=`~!$%^&*|'"<>?,()=
illegal_macro_output_chars=`~$&|'"<>
use_regexp_matching=0
use_true_regexp_matching=0
admin_email=nagios@localhost
admin_pager=pagenagios@localhost
daemon_dumps_core=0
use_large_installation_tweaks=0
enable_environment_macros=0
debug_level=280
debug_verbosity=2
debug_file=/usr/local/nagios/var/nagios.debug
max_debug_file_size=1000000
allow_empty_hostgroup_assignment=0


and ps output

Code: Select all
UID        PID  PPID  C STIME TTY          TIME CMD
root         1     0  0  2016 ?        00:02:03 /sbin/init
root         2     0  0  2016 ?        00:00:01 [kthreadd]
root         3     2  0  2016 ?        02:51:19 [migration/0]
root         4     2  0  2016 ?        00:02:49 [ksoftirqd/0]
root         5     2  0  2016 ?        00:00:00 [migration/0]
root         6     2  0  2016 ?        00:01:15 [watchdog/0]
root         7     2  0  2016 ?        02:41:18 [migration/1]
root         8     2  0  2016 ?        00:00:00 [migration/1]
root         9     2  0  2016 ?        00:03:23 [ksoftirqd/1]
root        10     2  0  2016 ?        00:01:10 [watchdog/1]
root        11     2  0  2016 ?        03:02:07 [migration/2]
root        12     2  0  2016 ?        00:00:00 [migration/2]
root        13     2  0  2016 ?        00:05:12 [ksoftirqd/2]
root        14     2  0  2016 ?        00:01:15 [watchdog/2]
root        15     2  0  2016 ?        01:39:51 [migration/3]
root        16     2  0  2016 ?        00:00:00 [migration/3]
root        17     2  0  2016 ?        00:02:41 [ksoftirqd/3]
root        18     2  0  2016 ?        00:01:07 [watchdog/3]
root        19     2  0  2016 ?        01:39:00 [migration/4]
root        20     2  0  2016 ?        00:00:00 [migration/4]
root        21     2  0  2016 ?        00:02:23 [ksoftirqd/4]
root        22     2  0  2016 ?        00:01:04 [watchdog/4]
root        23     2  0  2016 ?        01:30:46 [migration/5]
root        24     2  0  2016 ?        00:00:00 [migration/5]
root        25     2  0  2016 ?        00:02:15 [ksoftirqd/5]
root        26     2  0  2016 ?        00:01:03 [watchdog/5]
root        27     2  0  2016 ?        00:57:43 [events/0]
root        28     2  0  2016 ?        00:47:11 [events/1]
root        29     2  0  2016 ?        00:57:47 [events/2]
root        30     2  0  2016 ?        00:43:16 [events/3]
root        31     2  0  2016 ?        01:00:25 [events/4]
root        32     2  0  2016 ?        01:01:14 [events/5]
root        33     2  0  2016 ?        00:00:00 [cgroup]
root        34     2  0  2016 ?        00:00:00 [khelper]
root        35     2  0  2016 ?        00:00:00 [netns]
root        36     2  0  2016 ?        00:00:00 [async/mgr]
root        37     2  0  2016 ?        00:00:00 [pm]
root        38     2  0  2016 ?        00:03:21 [sync_supers]
root        39     2  0  2016 ?        00:03:36 [bdi-default]
root        40     2  0  2016 ?        00:00:00 [kintegrityd/0]
root        41     2  0  2016 ?        00:00:00 [kintegrityd/1]
root        42     2  0  2016 ?        00:00:00 [kintegrityd/2]
root        43     2  0  2016 ?        00:00:00 [kintegrityd/3]
root        44     2  0  2016 ?        00:00:00 [kintegrityd/4]
root        45     2  0  2016 ?        00:00:00 [kintegrityd/5]
root        46     2  0  2016 ?        00:45:36 [kblockd/0]
root        47     2  0  2016 ?        00:00:13 [kblockd/1]
root        48     2  0  2016 ?        00:00:09 [kblockd/2]
root        49     2  0  2016 ?        00:00:06 [kblockd/3]
root        50     2  0  2016 ?        00:00:04 [kblockd/4]
root        51     2  0  2016 ?        00:00:04 [kblockd/5]
root        52     2  0  2016 ?        00:00:00 [kacpid]
root        53     2  0  2016 ?        00:00:00 [kacpi_notify]
root        54     2  0  2016 ?        00:00:00 [kacpi_hotplug]
root        55     2  0  2016 ?        00:00:00 [ata_aux]
root        56     2  0  2016 ?        00:00:00 [ata_sff/0]
root        57     2  0  2016 ?        00:00:00 [ata_sff/1]
root        58     2  0  2016 ?        00:00:00 [ata_sff/2]
root        59     2  0  2016 ?        00:00:00 [ata_sff/3]
root        60     2  0  2016 ?        00:00:00 [ata_sff/4]
root        61     2  0  2016 ?        00:00:00 [ata_sff/5]
root        62     2  0  2016 ?        00:00:00 [ksuspend_usbd]
root        63     2  0  2016 ?        00:00:00 [khubd]
root        64     2  0  2016 ?        00:00:00 [kseriod]
root        65     2  0  2016 ?        00:00:00 [md/0]
root        66     2  0  2016 ?        00:00:00 [md/1]
root        67     2  0  2016 ?        00:00:00 [md/2]
root        68     2  0  2016 ?        00:00:00 [md/3]
root        69     2  0  2016 ?        00:00:00 [md/4]
root        70     2  0  2016 ?        00:00:00 [md/5]
root        71     2  0  2016 ?        00:00:00 [md_misc/0]
root        72     2  0  2016 ?        00:00:00 [md_misc/1]
root        73     2  0  2016 ?        00:00:00 [md_misc/2]
root        74     2  0  2016 ?        00:00:00 [md_misc/3]
root        75     2  0  2016 ?        00:00:00 [md_misc/4]
root        76     2  0  2016 ?        00:00:00 [md_misc/5]
root        77     2  0  2016 ?        00:00:00 [linkwatch]
root        78     2  0  2016 ?        00:00:20 [khungtaskd]
root        79     2  0  2016 ?        00:00:43 [kswapd0]
root        80     2  0  2016 ?        00:00:00 [ksmd]
root        81     2  0  2016 ?        00:04:52 [khugepaged]
root        82     2  0  2016 ?        00:00:00 [aio/0]
root        83     2  0  2016 ?        00:00:00 [aio/1]
root        84     2  0  2016 ?        00:00:00 [aio/2]
root        85     2  0  2016 ?        00:00:00 [aio/3]
root        86     2  0  2016 ?        00:00:00 [aio/4]
root        87     2  0  2016 ?        00:00:00 [aio/5]
root        88     2  0  2016 ?        00:00:00 [crypto/0]
root        89     2  0  2016 ?        00:00:00 [crypto/1]
root        90     2  0  2016 ?        00:00:00 [crypto/2]
root        91     2  0  2016 ?        00:00:00 [crypto/3]
root        92     2  0  2016 ?        00:00:00 [crypto/4]
root        93     2  0  2016 ?        00:00:00 [crypto/5]
root        98     2  0  2016 ?        00:00:00 [kthrotld/0]
root        99     2  0  2016 ?        00:00:00 [kthrotld/1]
root       100     2  0  2016 ?        00:00:00 [kthrotld/2]
root       101     2  0  2016 ?        00:00:00 [kthrotld/3]
root       102     2  0  2016 ?        00:00:00 [kthrotld/4]
root       103     2  0  2016 ?        00:00:00 [kthrotld/5]
root       105     2  0  2016 ?        00:00:00 [kpsmoused]
root       106     2  0  2016 ?        00:00:00 [usbhid_resumer]
root       136     2  0  2016 ?        00:00:00 [kstriped]
root       312     2  0  2016 ?        00:00:00 [scsi_eh_0]
root       313     2  0  2016 ?        00:00:00 [scsi_eh_1]
root       345     2  0  2016 ?        00:00:00 [virtio-blk]
root       361     2  0  2016 ?        01:42:36 [jbd2/vda2-8]
root       362     2  0  2016 ?        00:00:00 [ext4-dio-unwrit]
root       453     1  0  2016 ?        00:00:00 /sbin/udevd -d
root       667     2  0  2016 ?        00:00:00 [virtio-net]
root       669     2  0  2016 ?        00:00:00 [vballoon]
root       908   453  0  2016 ?        00:00:00 /sbin/udevd -d
apache     920  6242  0 12:18 ?        00:00:01 /usr/sbin/httpd
root       940     2  0  2016 ?        00:00:00 [jbd2/vda1-8]
root       941     2  0  2016 ?        00:00:00 [ext4-dio-unwrit]
root       984     2  0  2016 ?        00:02:16 [kauditd]
root      1081     2  0  2016 ?        00:45:58 [flush-252:0]
root      1160     1  0  2016 ?        00:05:18 auditd
root      1178     1  0  2016 ?        00:00:00 /sbin/portreserve
root      1185     1  0  2016 ?        01:11:14 /sbin/rsyslogd -i /var/run/syslogd.pid -c 5
root      1214     1  0  2016 ?        00:51:06 irqbalance --pid=/var/run/irqbalance.pid
rpc       1228     1  0  2016 ?        00:00:58 rpcbind
rpcuser   1246     1  0  2016 ?        00:00:00 rpc.statd
dbus      1375     1  0  2016 ?        00:00:02 dbus-daemon --system
root      1404     1  0  2016 ?        00:00:00 /usr/sbin/acpid
68        1414     1  0  2016 ?        00:03:19 hald
root      1415  1414  0  2016 ?        00:00:00 hald-runner
root      1444  1415  0  2016 ?        00:00:00 hald-addon-input: Listening on /dev/input/event2 /dev/input/event0
68        1458  1415  0  2016 ?        00:00:00 hald-addon-acpi: listening on acpid socket /var/run/acpid.socket
root      1471   453  0  2016 ?        00:00:00 /sbin/udevd -d
root      1477     1  0  2016 ?        00:36:31 automount --pid-file /var/run/autofs.pid
root      1497     1  0  2016 ?        00:03:33 /usr/sbin/sshd
root      1532     1  0  2016 ?        00:00:00 /bin/sh /usr/bin/mysqld_safe --datadir=/var/lib/mysql --socket=/var/lib/mysql/mysql.sock --pid-file=/var/run/mysqld/mysqld.pid --basedir=/usr --user=mysql
mysql     1634  1532  0  2016 ?        03:38:59 /usr/libexec/mysqld --basedir=/usr --datadir=/var/lib/mysql --user=mysql --log-error=/var/log/mysqld.log --pid-file=/var/run/mysqld/mysqld.pid --socket=/var/lib/mysql/mysql.sock
root      1726     1  0  2016 ?        01:21:21 /usr/libexec/postfix/master
postfix   1735  1726  0  2016 ?        02:15:13 qmgr -l -t fifo -u
postfix   1748  1726  0 16:41 ?        00:00:00 pickup -l -t fifo -u
root      1750     1  0  2016 ?        00:04:56 crond
root      1761     1  0  2016 ?        00:00:00 /usr/sbin/atd
root      1808     1  0  2016 ?        00:00:00 /opt/dell/srvadmin/sbin/dsm_om_connsvcd -run
root      1809  1808  0  2016 ?        17:02:14 /opt/dell/srvadmin/sbin/dsm_om_connsvcd -run
root      1882     1  0  2016 tty2     00:00:00 /sbin/mingetty /dev/tty2
root      1884     1  0  2016 tty3     00:00:00 /sbin/mingetty /dev/tty3
root      1886     1  0  2016 tty4     00:00:00 /sbin/mingetty /dev/tty4
root      1888     1  0  2016 tty5     00:00:00 /sbin/mingetty /dev/tty5
root      1890     1  0  2016 tty6     00:00:00 /sbin/mingetty /dev/tty6
root      2284  1497  0 14:40 ?        00:00:00 sshd: root@pts/0
root      2398  2284  0 14:40 pts/0    00:00:00 -bash
root      2439  2398  0 14:40 pts/0    00:00:00 su - nagios
nagios    2440  2439  0 14:40 pts/0    00:00:00 -bash
postfix   2753  1726  0 16:42 ?        00:00:00 bounce -z -n defer -t unix -u
postfix   3359  1726  0 16:43 ?        00:00:00 error -n retry -t unix -u
apache    4255  6242  0 12:21 ?        00:00:01 /usr/sbin/httpd
nagios    4319     1  0  2017 ?        00:00:00 /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg
postfix   5305  1726  0 16:44 ?        00:00:00 cleanup -z -t unix -u
postfix   5804  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5805  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5806  1726  0 16:45 ?        00:00:00 bounce -z -n defer -t unix -u
postfix   5807  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5808  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5809  1726  0 16:45 ?        00:00:00 bounce -z -n defer -t unix -u
postfix   5810  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5811  1726  0 16:45 ?        00:00:00 bounce -z -n defer -t unix -u
postfix   5812  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5813  1726  0 16:45 ?        00:00:00 bounce -z -n defer -t unix -u
postfix   5815  1726  0 16:45 ?        00:00:00 bounce -z -n defer -t unix -u
postfix   5816  1726  0 16:45 ?        00:00:00 error -n retry -t unix -u
postfix   5819  1726  0 16:45 ?        00:00:00 bounce -z -n defer -t unix -u
root      6040     2  0 Jan02 ?        00:00:00 [bluetooth]
root      6242     1  0  2016 ?        00:20:04 /usr/sbin/httpd
apache    6251  6242  0 Jan10 ?        00:00:05 /usr/sbin/httpd
apache    6257  6242  0 Jan10 ?        00:00:05 /usr/sbin/httpd
apache    6258  6242  0 Jan10 ?        00:00:04 /usr/sbin/httpd
nagios    6841 26464  0 16:46 ?        00:00:00 /usr/local/nagios/libexec/check_ping -H 10.102.3.75 -w 3000.0,80% -c 5000.0,100% -p 5
nagios    6842  6841  0 16:46 ?        00:00:00 /bin/ping -n -U -w 30 -c 5 10.102.3.75
nagios    6854 26461  0 16:46 ?        00:00:00 /usr/local/nagios/libexec/check_ping -H 10.54.205.61 -w 100.0,20% -c 500.0,60% -p 5
nagios    6855  6854  0 16:46 ?        00:00:00 /bin/ping -n -U -w 10 -c 5 10.54.205.61
nagios    6857 26468  0 16:46 ?        00:00:00 /usr/local/nagios/libexec/check_ping -H 10.54.170.61 -w 100.0,20% -c 500.0,60% -p 5
nagios    6858  6857  0 16:46 ?        00:00:00 /bin/ping -n -U -w 10 -c 5 10.54.170.61
nagios    6861  2440  1 16:46 pts/0    00:00:00 ps -ef --cols=300
ntp      11603     1  0  2016 ?        00:00:53 ntpd -u ntp:ntp -p /var/run/ntpd.pid -g
apache   12479  6242  0 Jan09 ?        00:00:11 /usr/sbin/httpd
root     15186     2  0  2016 ?        00:00:08 [rpciod/0]
root     15187     2  0  2016 ?        00:00:00 [rpciod/1]
root     15188     2  0  2016 ?        00:23:24 [rpciod/2]
root     15189     2  0  2016 ?        00:00:01 [rpciod/3]
root     15190     2  0  2016 ?        00:00:00 [rpciod/4]
root     15191     2  0  2016 ?        00:00:00 [rpciod/5]
root     15194     2  0  2016 ?        00:00:00 [kslowd000]
root     15195     2  0  2016 ?        00:00:00 [kslowd001]
root     15196     2  0  2016 ?        00:05:58 [nfsiod]
root     16723     2  0  2016 ?        00:00:00 [nfsv4.0-svc]
apache   22925  6242  0 14:58 ?        00:00:00 /usr/sbin/httpd
apache   22931  6242  0 14:58 ?        00:00:00 /usr/sbin/httpd
apache   22932  6242  0 14:58 ?        00:00:00 /usr/sbin/httpd
root     24946     1  0  2016 ?        00:00:00 /usr/sbin/console-kit-daemon --no-daemon
nagios   26458     1  1 16:01 ?        00:00:47 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios   26460 26458  0 16:01 ?        00:00:03 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26461 26458  0 16:01 ?        00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26462 26458  0 16:01 ?        00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26463 26458  0 16:01 ?        00:00:03 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26464 26458  0 16:01 ?        00:00:03 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26465 26458  0 16:01 ?        00:00:03 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26466 26458  0 16:01 ?        00:00:03 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26467 26458  0 16:01 ?        00:00:03 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26468 26458  0 16:01 ?        00:00:02 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
nagios   26469 26458  0 16:01 ?        00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
root     26617     1  0  2016 tty1     00:00:00 /sbin/mingetty /dev/tty1
apache   32603  6242  0 12:17 ?        00:00:01 /usr/sbin/httpd

octom
 
Posts: 4
Joined: Mon May 19, 2014 11:43 am

Re: Passive checks work once then timeout for the next 2 min

Postby cdienger » Fri Jan 12, 2018 3:45 pm

Was the tcpdump taken on the Core side or client side? If Core, try running it on the client side to see if it's making any attempt to connect to Core or if you see anything else that's odd.
User avatar
cdienger
Support Tech
 
Posts: 842
Joined: Tue Feb 07, 2017 11:26 am

Re: Passive checks work once then timeout for the next 2 min

Postby octom » Thu Jan 18, 2018 11:32 am

Captures wre taken on both the core side and the hypervisor between the core and the client. During the two minute "radio silence" the hypervisor sees traffic but the Core VM does not.
octom
 
Posts: 4
Joined: Mon May 19, 2014 11:43 am

Re: Passive checks work once then timeout for the next 2 min

Postby cdienger » Fri Jan 19, 2018 4:52 pm

I'd agree then that this appears to be a networking issue. Hopefully the networking team will be able to help track down these missing packets.
User avatar
cdienger
Support Tech
 
Posts: 842
Joined: Tue Feb 07, 2017 11:26 am


Return to Nagios Core

Who is online

Users browsing this forum: No registered users and 11 guests