Multiple instances f same nagios processes

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
anuragvidyarthi
Posts: 4
Joined: Fri Jan 27, 2012 8:56 am

Multiple instances f same nagios processes

Post by anuragvidyarthi »

Hi,

We have Nagios core 3.0 running in one of the local zones of solaris 10 machine ( on sparc).
I noticed that as soon as I start the Nagios process by following command :

/usr/local/nagios/bin/nagios -d -d /usr/local/nagios/etc/nagios.cfg

I see that it has spawned many processes:
( which I verify by following command : ps -elf | grep /usr/local/nagios/bin/nagios )

F S UID PID PPID C PRI NI ADDR SZ WCHAN STIME TTY TIME CMD
0 S radmin 1994 16078 0 40 20 ? 1039 ? 14:54:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 4575 16078 0 40 20 ? 1039 ? 14:57:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 11009 16078 0 40 20 ? 1039 ? 15:05:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 11021 16078 0 40 20 ? 1039 ? 15:05:03 ? 0:00 /usr/local/nagios/bin/nagios -d /us

PS : 16078 is process id of zsched( so it is fine ).

and these lines keep moving up and down ( but increasing ).

I checked on some other similar Nagios installation but found single instance of nagios running there.
Has anybody got the idea why I am having multiple instances of nagios.

I checked the log and there is no error or reference of multiple instances.

System is running fine but , I am still not able to solve this mystry ? Is it some kind of bug ?

Best Regards,
Anurag Vidyarthi
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Multiple instances f same nagios processes

Post by mguthrie »

/usr/local/nagios/bin/nagios -d -d /usr/local/nagios/etc/nagios.cfg
Should just be:
/usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
Also, the nagios process forks itself to run checks and subprocesses, so as long as you don't have multiple parent processes, you should be fine.
anuragvidyarthi
Posts: 4
Joined: Fri Jan 27, 2012 8:56 am

Re: Multiple instances f same nagios processes

Post by anuragvidyarthi »

Hi,

that was typing mistake, it has only one '-d' :

/usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg

I got your point that it forks other subprocesses. But why my other nagios setups do not show such behaviour.
Is their any setting which activate such behaviour ?

PS ! Everything is fine with nagios monitoring but the number of sub-processes are just going up and up.

proddmin@nagios:~$ ps -elf | head -1 && ps -elf | grep /usr/local/nagios/bin/nagios | sort +12 && ps -elf | grep /usr/local/nagios/bin/nagios | wc -l F S UID PID PPID C PRI NI ADDR SZ WCHAN STIME TTY TIME CMD
0 S radmin 1994 16078 0 40 20 ? 1039 ? Jan 27 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 4575 16078 0 40 20 ? 1039 ? Jan 27 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 16310 16078 0 40 20 ? 1039 ? Jan 27 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 18247 16078 0 40 20 ? 1039 ? Jan 27 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19798 16078 0 40 20 ? 1039 ? Jan 27 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 6402 16078 0 47 20 ? 1041 ? Jan 27 ? 11:28 /usr/local/nagios/bin/nagios -d /us
0 S radmin 2196 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 4362 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 6718 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 12558 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 14956 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 16325 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 21426 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 22965 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 25190 16078 0 40 20 ? 1039 ? Jan 28 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 10677 16078 0 40 20 ? 1039 ? Jan 29 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 28751 16078 0 40 20 ? 1039 ? Jan 29 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 446 16078 0 40 20 ? 1039 ? 06:12:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 6058 16078 0 40 20 ? 1039 ? 03:54:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 9397 16078 0 40 20 ? 1039 ? 01:31:44 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 15569 16078 0 40 20 ? 1039 ? 01:39:06 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 16674 16078 0 40 20 ? 1039 ? 20:11:45 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19151 16078 0 40 20 ? 1039 ? 08:23:49 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19161 16078 0 40 20 ? 1039 ? 08:23:50 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19173 16078 0 40 20 ? 1039 ? 08:23:51 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19205 16078 0 40 20 ? 1039 ? 08:23:51 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19209 16078 0 40 20 ? 1039 ? 08:23:51 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19321 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19325 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19329 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19337 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19341 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19345 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19349 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19353 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19357 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19361 16078 0 40 20 ? 1039 ? 08:24:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19505 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19509 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19513 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19517 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19521 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19525 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19529 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19533 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19537 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19541 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19545 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19549 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19553 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19557 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19561 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19565 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19569 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19573 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19577 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19581 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19585 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19589 16078 0 40 20 ? 1039 ? 08:24:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19703 16078 0 40 20 ? 1039 ? 08:24:17 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19723 16078 0 40 20 ? 1039 ? 08:24:20 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19745 16078 0 40 20 ? 1039 ? 08:24:21 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19757 16078 0 40 20 ? 1039 ? 08:24:21 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19854 16078 0 40 20 ? 1039 ? 08:24:31 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19859 16078 0 40 20 ? 1039 ? 08:24:31 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19864 16078 0 40 20 ? 1039 ? 08:24:31 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19869 16078 0 40 20 ? 1039 ? 08:24:31 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19894 16078 0 40 20 ? 1039 ? 08:24:33 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 27132 16078 0 40 20 ? 1039 ? 00:04:01 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 28106 16078 0 40 20 ? 1039 ? 14:56:11 ? 0:00 /usr/local/nagios/bin/nagios -d /us
0 S radmin 19932 16127 0 70 20 ? 357 ? 08:24:35 pts/2 0:00 grep /usr/local/nagios/bin/nagios

Now we have total 71 processes.

proddmin@nagios:~$ ps -ef | grep 16078
root 16078 16078 0 Sep 10 ? 0:00 zsched

Best Regards,
Anurag
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Multiple instances f same nagios processes

Post by mguthrie »

I'm noticing you have the owner radmin for all of these processes. The nagios user should probably be the only owner of the nagios process. If either the radmin or nagios users don't have file permission to control the lock file, a new instance will probably just get spawned each time a restart command is issued to nagios. I'm guessing the answer is in the process ownership.
anuragvidyarthi
Posts: 4
Joined: Fri Jan 27, 2012 8:56 am

Re: Multiple instances f same nagios processes

Post by anuragvidyarthi »

Thanks for reply !

Nagios is installed in radmin account i.e everything is done/managed by radmin.
We do not have other user who is owning any nagios process.

BR,
Anurag
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Multiple instances f same nagios processes

Post by mguthrie »

Ok, if you're using the /etc/init.d/nagios script to start and stop nagios (or service nagios restart), you may want to review the init script and make sure that user has permissions to control the lock file. If for some reason a user could restart the nagios process, but didn't have permission to read or create a lock file, that could cause multiple instances.
anuragvidyarthi
Posts: 4
Joined: Fri Jan 27, 2012 8:56 am

Re: Multiple instances f same nagios processes

Post by anuragvidyarthi »

We use SMF to manage the nagios start/stop and that menifest has following entry to start and stop
/reuters/etc/smf_nagios start
/reuters/etc/smf_nagios stop

I checked that radmin has owner of this file and directory. file /reuters/etc/smf_nagios actually calls the following command :
/usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg

radmin has all permissions on :
/usr/local/nagios/bin/nagios
/usr/local/nagios/etc/nagios.cfg

Still I checked /etc/init.d/nagios and found that radmin does not have any permission on it and only root who has all permissions on it.
Locked