Page 1 of 1
Weird Issues with Nagios
Posted: Sat May 07, 2016 8:50 am
by soamz
I have a server, which has 2 ports only.
1 port is connected directly to the router to give it the public IP access.
2nd port is connected to my access network switch, which has more than 100+ devices, which are on private IP, 10.10.10.xxx , 10.10.11.xxx
Now in the server, all the devices show up perfectly fine, if I do arp -a command.
But if I try to ping any random devices, it doesnt ping. Then again it starts pinging after 5 mins, again stops, again pings in 1 day. Its all random ghost!
What could be the issue.
For example, im able to ping 10.10.10.23 device from the command line.
But Nagios tells it as offline.
Im on 4.1.1 nagios installed on ubuntu server latest.
Re: Weird Issues with Nagios
Posted: Sun May 08, 2016 7:33 pm
by Box293
Is your server low on memory or disk space?
What is the output of:
Code: Select all
ps -ef | grep nagios.cfg | grep -v grep
Can you please post your nagios.cfg file.
On your Nagios host, in multiple SSH sessions, have you kept a ping running continually on a bunch of devices to see if there is actually a problem occurring on your network?
Re: Weird Issues with Nagios
Posted: Mon May 09, 2016 5:44 am
by soamz
Code: Select all
root@jetnms:~# free -m
total used free shared buffers cached
Mem: 16043 6070 9972 120 204 4765
-/+ buffers/cache: 1101 14941
Swap: 16376 0 16376
root@jetnms:~#
Code: Select all
root@jetnms:~# df -h
Filesystem Size Used Avail Use% Mounted on
udev 7.9G 4.0K 7.9G 1% /dev
tmpfs 1.6G 584K 1.6G 1% /run
/dev/sda1 443G 10G 410G 3% /
none 4.0K 0 4.0K 0% /sys/fs/cgroup
none 5.0M 0 5.0M 0% /run/lock
none 7.9G 4.0K 7.9G 1% /run/shm
none 100M 0 100M 0% /run/user
root@jetnms:~#
Code: Select all
root@jetnms:~# df -i
Filesystem Inodes IUsed IFree IUse% Mounted on
udev 2050846 445 2050401 1% /dev
tmpfs 2053530 438 2053092 1% /run
/dev/sda1 29450240 163245 29286995 1% /
none 2053530 2 2053528 1% /sys/fs/cgroup
none 2053530 6 2053524 1% /run/lock
none 2053530 2 2053528 1% /run/shm
none 2053530 2 2053528 1% /run/user
root@jetnms:~#
Re: Weird Issues with Nagios
Posted: Mon May 09, 2016 5:45 am
by soamz
Code: Select all
root@jetnms:~# ps -ef | grep nagios.cfg | grep -v grep
nagios 19590 1 0 May08 ? 00:00:39 bin/nagios etc/nagios.cfg
nagios 19604 19590 0 May08 ? 00:00:03 bin/nagios etc/nagios.cfg
root@jetnms:~#
And attached my /etc/nagios4/conf.d/jetnms.cfg
Re: Weird Issues with Nagios
Posted: Mon May 09, 2016 1:27 pm
by rkennedy
What is the output of route -n? It sounds like a routing issue more than a Nagios problem at this point.
Re: Weird Issues with Nagios
Posted: Mon May 09, 2016 2:02 pm
by soamz
rkennedy wrote:What is the output of route -n? It sounds like a routing issue more than a Nagios problem at this point.
Code: Select all
root@jetnms:~# route -n
Kernel IP routing table
Destination Gateway Genmask Flags Metric Ref Use Iface
0.0.0.0 103.194.232.1 0.0.0.0 UG 0 0 0 eth0
7.7.7.0 0.0.0.0 255.255.255.0 U 0 0 0 eth1
10.10.10.0 0.0.0.0 255.255.255.0 U 0 0 0 eth1
10.10.11.0 0.0.0.0 255.255.255.0 U 0 0 0 eth1
103.194.232.0 0.0.0.0 255.255.255.240 U 0 0 0 eth0
Re: Weird Issues with Nagios
Posted: Mon May 09, 2016 4:30 pm
by rkennedy
Going to close this one up so we aren't working on the same issue at play as here -
https://support.nagios.com/forum/viewto ... 999#bottom
We will continue in that thread since these are related.