Nagios distributed monitoring

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
whitest
Posts: 107
Joined: Tue Dec 30, 2014 8:16 am

Re: Nagios distributed monitoring

Post by whitest »

tmcdonald or abrist,
how many nagiosfusion-servers I need to pelace my current solution? Now I have 1 Master-server in Head-office and 4 Slave-servers in regions.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Nagios distributed monitoring

Post by tmcdonald »

whitest wrote:tmcdonald or abrist,
how many nagiosfusion-servers I need to pelace my current solution? Now I have 1 Master-server in Head-office and 4 Slave-servers in regions.
How many total hosts and services do you have across those machines? Really since Fusion is only displaying data and not running checks it should not matter and 1 Fusion machine will suffice, but if you have a crazy amount (like 1 million objects) it might get sluggish.
Former Nagios employee
whitest
Posts: 107
Joined: Tue Dec 30, 2014 8:16 am

Re: Nagios distributed monitoring

Post by whitest »

Luckly there are not crazy amount =)
It wiil be 7-8 regions. 2-3k services on each.
So, I need I Fushion as Central desktop and 7-8 Nagios Cores in regions?
klajosh2
Posts: 38
Joined: Thu Jan 16, 2014 5:22 am

Re: Nagios distributed monitoring

Post by klajosh2 »

But with this setup you would loose the feature of the central configuration.
You have to maintain 7-8 nagios instances + nagiosfusion. Is this the road you want to hit?
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Nagios distributed monitoring

Post by abrist »

klajosh2 wrote:But with this setup you would loose the feature of the central configuration.
Indeed. Mod_gearman currently is the only solid way to really run real distributed checks with 1 central config management server. Otherwise, you will need to maintain multiple separate configurations. Though, with enough forethought, a tool like puppet/chef and some clever scripting could make the management of multiple configs a breeze.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
whitest
Posts: 107
Joined: Tue Dec 30, 2014 8:16 am

Re: Nagios distributed monitoring

Post by whitest »

klajosh2 wrote:But with this setup you would loose the feature of the central configuration.
You have to maintain 7-8 nagios instances + nagiosfusion. Is this the road you want to hit?
My client wanted to have sepatare nagios for each region for active checks, notifies and dashboard (for local stuff). Central Nagios checks services only in head-office and recieves check-results from all region-nagioses. Central Nagios displays all network view for stuff in head-office.

I used scripting with Rsync and Passwordless SSH (in /etc/init.d/nagios) to acheive central configuration. Config files has been organized accordingly.
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Nagios distributed monitoring

Post by jdalrymple »

Going back to the original point of contention - I'd argue that mod_gearman is perfectly stable. We have many clients running gearman workers that are processing as many checks as your entire environment put together. It has proven pretty reliable for us. If you're experiencing mod_gearman crashes I suggest troubleshooting there and resolving that. As abrist said though - Fusion may be an even more appropriate solution for you depending on your use case.
whitest
Posts: 107
Joined: Tue Dec 30, 2014 8:16 am

Re: Nagios distributed monitoring

Post by whitest »

jdalrymple wrote:Going back to the original point of contention - I'd argue that mod_gearman is perfectly stable. We have many clients running gearman workers that are processing as many checks as your entire environment put together. It has proven pretty reliable for us. If you're experiencing mod_gearman crashes I suggest troubleshooting there and resolving that. As abrist said though - Fusion may be an even more appropriate solution for you depending on your use case.
Does mod_gearman support Nagios 4.0 - Nagios 4.1 properly ?
I realy many times experienced orhanted checks in mod_gearman =(
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Nagios distributed monitoring

Post by jdalrymple »

Yes - but it's version dependent. Described here.
klajosh2
Posts: 38
Joined: Thu Jan 16, 2014 5:22 am

Re: Nagios distributed monitoring

Post by klajosh2 »

jdalrymple wrote:Going back to the original point of contention - I'd argue that mod_gearman is perfectly stable. We have many clients running gearman workers that are processing as many checks as your entire environment put together. It has proven pretty reliable for us. If you're experiencing mod_gearman crashes I suggest troubleshooting there and resolving that. As abrist said though - Fusion may be an even more appropriate solution for you depending on your use case.
Maybe this worth another thread: What is the best way to debug mod_gearman? I am asking because when crash comes I am usually not sitting in front of the server checking log files.
What I noticed - what is very strange actually - is that I do not see orphaned checks just checks are not executed. only thing helps is to restart main nagios process and gearmand service.
After this checks are starting. I am using nagios core 3.5.1 and gearmand 0.33.

klajosh
Locked