Page 1 of 1

Weird Issues with Nagios

Posted: Sat May 07, 2016 8:50 am
by soamz
I have a server, which has 2 ports only.
1 port is connected directly to the router to give it the public IP access.
2nd port is connected to my access network switch, which has more than 100+ devices, which are on private IP, 10.10.10.xxx , 10.10.11.xxx


Now in the server, all the devices show up perfectly fine, if I do arp -a command.

But if I try to ping any random devices, it doesnt ping. Then again it starts pinging after 5 mins, again stops, again pings in 1 day. Its all random ghost!

What could be the issue.

For example, im able to ping 10.10.10.23 device from the command line.
But Nagios tells it as offline.

Im on 4.1.1 nagios installed on ubuntu server latest.

Re: Weird Issues with Nagios

Posted: Sun May 08, 2016 7:33 pm
by Box293
Is your server low on memory or disk space?

Code: Select all

free -m
df -h
df -i
What is the output of:

Code: Select all

ps -ef | grep nagios.cfg | grep -v grep
Can you please post your nagios.cfg file.

On your Nagios host, in multiple SSH sessions, have you kept a ping running continually on a bunch of devices to see if there is actually a problem occurring on your network?

Re: Weird Issues with Nagios

Posted: Mon May 09, 2016 5:44 am
by soamz

Code: Select all

root@jetnms:~# free -m
             total       used       free     shared    buffers     cached
Mem:         16043       6070       9972        120        204       4765
-/+ buffers/cache:       1101      14941
Swap:        16376          0      16376
root@jetnms:~# 

Code: Select all

root@jetnms:~# df -h
Filesystem      Size  Used Avail Use% Mounted on
udev            7.9G  4.0K  7.9G   1% /dev
tmpfs           1.6G  584K  1.6G   1% /run
/dev/sda1       443G   10G  410G   3% /
none            4.0K     0  4.0K   0% /sys/fs/cgroup
none            5.0M     0  5.0M   0% /run/lock
none            7.9G  4.0K  7.9G   1% /run/shm
none            100M     0  100M   0% /run/user
root@jetnms:~# 

Code: Select all

root@jetnms:~# df -i
Filesystem       Inodes  IUsed    IFree IUse% Mounted on
udev            2050846    445  2050401    1% /dev
tmpfs           2053530    438  2053092    1% /run
/dev/sda1      29450240 163245 29286995    1% /
none            2053530      2  2053528    1% /sys/fs/cgroup
none            2053530      6  2053524    1% /run/lock
none            2053530      2  2053528    1% /run/shm
none            2053530      2  2053528    1% /run/user
root@jetnms:~# 

Re: Weird Issues with Nagios

Posted: Mon May 09, 2016 5:45 am
by soamz

Code: Select all

root@jetnms:~# ps -ef | grep nagios.cfg | grep -v grep
nagios   19590     1  0 May08 ?        00:00:39 bin/nagios etc/nagios.cfg
nagios   19604 19590  0 May08 ?        00:00:03 bin/nagios etc/nagios.cfg
root@jetnms:~# 

And attached my /etc/nagios4/conf.d/jetnms.cfg

Re: Weird Issues with Nagios

Posted: Mon May 09, 2016 1:27 pm
by rkennedy
What is the output of route -n? It sounds like a routing issue more than a Nagios problem at this point.

Re: Weird Issues with Nagios

Posted: Mon May 09, 2016 2:02 pm
by soamz
rkennedy wrote:What is the output of route -n? It sounds like a routing issue more than a Nagios problem at this point.

Code: Select all

root@jetnms:~# route -n
Kernel IP routing table
Destination     Gateway         Genmask         Flags Metric Ref    Use Iface
0.0.0.0         103.194.232.1   0.0.0.0         UG    0      0        0 eth0
7.7.7.0         0.0.0.0         255.255.255.0   U     0      0        0 eth1
10.10.10.0      0.0.0.0         255.255.255.0   U     0      0        0 eth1
10.10.11.0      0.0.0.0         255.255.255.0   U     0      0        0 eth1
103.194.232.0   0.0.0.0         255.255.255.240 U     0      0        0 eth0

Re: Weird Issues with Nagios

Posted: Mon May 09, 2016 4:30 pm
by rkennedy
Going to close this one up so we aren't working on the same issue at play as here -
https://support.nagios.com/forum/viewto ... 999#bottom

We will continue in that thread since these are related.