I'm working on a new Nagios install where we have moved our backup from one server and restored it on another. We have been putting out fires along the way, the most recent of which has me stumped. We're seeing an issue where Nagios will spawn an ever-increasing number of Nagios instances running over a period of time until the box succumbs to the load. The highest I've seen in 222 instances. Below is
Code: Select all
$ ps aux | grep nagios.cfg | grep -v grep
nagios 1497 0.0 0.1 156352 39852 ? S 14:35 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1500 0.0 0.1 156352 39852 ? S 14:35 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1808 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1836 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1837 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1838 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1840 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1843 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1845 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1847 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1849 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1851 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1852 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1855 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 1857 0.0 0.1 156352 40596 ? S 15:39 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 4888 5.5 0.1 156340 41572 ? Ssl 13:42 6:31 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 15966 0.0 0.1 156352 38660 ? S 14:00 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 30058 0.0 0.1 156344 40412 ? S 15:30 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 30207 0.0 0.1 156352 39944 ? S 14:28 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 31757 0.0 0.1 156352 39932 ? S 14:31 0:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfgCode: Select all
Jan 22 15:45:17 servername nagios: SERVICE ALERT: XXXXXXX;Portal;CRITICAL;SOFT;2;No process matching Portal.exe found : CRITICAL
Jan 22 15:45:17 servername nagios: SERVICE ALERT: XXXXXXX;Portal;CRITICAL;SOFT;2;No process matching Portal.exe found : CRITICALAny advice on how to proceed with troubleshooting would be appreciated.
Thanks,
nseltzer