Load Balancing Nagios XI

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
rbuckle
Posts: 146
Joined: Tue Aug 04, 2015 12:25 pm
Location: Idaho
Contact:

Load Balancing Nagios XI

Post by rbuckle »

Is it currently possible to load balance Nagio XI checks across hosts? can you make a cluster?

Basically my host can run up to 100% load across its CPU cores, I would like to know if i can add another machine to load balance the nagios core checks? it doesnt really need to run XI, just take some of the load.
----------------------------------
Running:
Installed Version: 5.8.3
OS: CentOS 7
Nagios XI
Enterprise License
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Load Balancing Nagios XI

Post by cdienger »

Mod-gearman can be used to distribute checks among hosts and take some of the load off of the XI server. See https://assets.nagios.com/downloads/nag ... ios_XI.pdf.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
rbuckle
Posts: 146
Joined: Tue Aug 04, 2015 12:25 pm
Location: Idaho
Contact:

Re: Load Balancing Nagios XI

Post by rbuckle »

i get a lot of:

CRITICAL: Return code of 127 is out of bounds. Make sure the plugin you're trying to run actually exists. (worker: w1)

do i need nagios core installed on the workers?
----------------------------------
Running:
Installed Version: 5.8.3
OS: CentOS 7
Nagios XI
Enterprise License
rbuckle
Posts: 146
Joined: Tue Aug 04, 2015 12:25 pm
Location: Idaho
Contact:

Re: Load Balancing Nagios XI

Post by rbuckle »

And i re-read through the document and installed the plugins, still get the error
----------------------------------
Running:
Installed Version: 5.8.3
OS: CentOS 7
Nagios XI
Enterprise License
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Load Balancing Nagios XI

Post by ssax »

What are the permissions of the directory and files?

Code: Select all

ls -ld /usr/local/nagios/libexec
ls -l /usr/local/nagios/libexec
Did you restart the gearman worker service after installing the plugins?

Code: Select all

service mod-gearman-worker restart
rbuckle
Posts: 146
Joined: Tue Aug 04, 2015 12:25 pm
Location: Idaho
Contact:

Re: Load Balancing Nagios XI

Post by rbuckle »

and i have restarted the worker service

The permissions are all

root:root

drwxr-xr-x 2 root root 4096 Apr 16 22:14 /usr/local/nagios/libexec

-rwxr-xr-x 1 root root 240152 Apr 16 22:14 check_apt
-rwxr-xr-x 1 root root 2354 Apr 16 22:14 check_breeze
-rwxr-xr-x 1 root root 245216 Apr 16 22:14 check_by_ssh
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_clamd -> check_tcp
-rwxr-xr-x 1 root root 182768 Apr 16 22:14 check_cluster
-rwxr-xr-x 1 root root 246592 Apr 16 22:14 check_dbi
-r-sr-xr-x 1 root root 247008 Apr 16 22:14 check_dhcp
-rwxr-xr-x 1 root root 232960 Apr 16 22:14 check_dig
-rwxr-xr-x 1 root root 250632 Apr 16 22:14 check_disk
-rwxr-xr-x 1 root root 9477 Apr 16 22:14 check_disk_smb
-rwxr-xr-x 1 root root 260200 Apr 16 22:14 check_dns
-rwxr-xr-x 1 root root 136512 Apr 16 22:14 check_dummy
-rwxr-xr-x 1 root root 3868 Apr 16 22:14 check_file_age
-rwxr-xr-x 1 root root 6512 Apr 16 22:14 check_flexlm
-rwxr-xr-x 1 root root 242600 Apr 16 22:14 check_fping
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_ftp -> check_tcp
-rwxr-xr-x 1 root root 205680 Apr 16 22:14 check_game
-rwxr-xr-x 1 root root 235048 Apr 16 22:14 check_hpjd
-rwxr-xr-x 1 root root 373712 Apr 16 22:14 check_http
-r-sr-xr-x 1 root root 258528 Apr 16 22:14 check_icmp
-rwxr-xr-x 1 root root 190232 Apr 16 22:14 check_ide_smart
-rwxr-xr-x 1 root root 15283 Apr 16 22:14 check_ifoperstatus
-rwxr-xr-x 1 root root 13435 Apr 16 22:14 check_ifstatus
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_imap -> check_tcp
-rwxr-xr-x 1 root root 6992 Apr 16 22:14 check_ircd
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_jabber -> check_tcp
-rwxr-xr-x 1 root root 220296 Apr 16 22:14 check_ldap
lrwxrwxrwx 1 root root 10 Apr 16 22:14 check_ldaps -> check_ldap
-rwxr-xr-x 1 root root 205752 Apr 16 22:14 check_load
-rwxr-xr-x 1 root root 6673 Apr 16 22:14 check_log
-rwxr-xr-x 1 root root 22741 Apr 16 22:14 check_mailq
-rwxr-xr-x 1 root root 192672 Apr 16 22:14 check_mrtg
-rwxr-xr-x 1 root root 194648 Apr 16 22:14 check_mrtgtraf
-rwxr-xr-x 1 root root 237248 Apr 16 22:14 check_mysql
-rwxr-xr-x 1 root root 222936 Apr 16 22:14 check_mysql_query
-rwxr-xr-x 1 root root 206872 Apr 16 22:14 check_nagios
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_nntp -> check_tcp
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_nntps -> check_tcp
-rwxr-xr-x 1 root root 249456 Apr 16 22:14 check_nt
-rwxr-xr-x 1 root root 256936 Apr 16 22:14 check_ntp
-rwxr-xr-x 1 root root 242096 Apr 16 22:14 check_ntp_peer
-rwxr-xr-x 1 root root 235504 Apr 16 22:14 check_ntp_time
-rwxr-xr-x 1 root root 264288 Apr 16 22:14 check_nwstat
-rwxr-xr-x 1 root root 9461 Apr 16 22:14 check_oracle
-rwxr-xr-x 1 root root 218640 Apr 16 22:14 check_overcr
-rwxr-xr-x 1 root root 231144 Apr 16 22:14 check_pgsql
-rwxr-xr-x 1 root root 249176 Apr 16 22:14 check_ping
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_pop -> check_tcp
-rwxr-xr-x 1 root root 248064 Apr 16 22:14 check_procs
-rwxr-xr-x 1 root root 215520 Apr 16 22:14 check_radius
-rwxr-xr-x 1 root root 215600 Apr 16 22:14 check_real
-rwxr-xr-x 1 root root 9687 Apr 16 22:14 check_rpc
-rwxr-xr-x 1 root root 1537 Apr 16 22:14 check_sensors
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_simap -> check_tcp
-rwxr-xr-x 1 root root 277584 Apr 16 22:14 check_smtp
-rwxr-xr-x 1 root root 292824 Apr 16 22:14 check_snmp
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_spop -> check_tcp
-rwxr-xr-x 1 root root 209144 Apr 16 22:14 check_ssh
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_ssmtp -> check_tcp
-rwxr-xr-x 1 root root 185528 Apr 16 22:14 check_swap
-rwxr-xr-x 1 root root 262704 Apr 16 22:14 check_tcp
-rwxr-xr-x 1 root root 209200 Apr 16 22:14 check_time
lrwxrwxrwx 1 root root 9 Apr 16 22:14 check_udp -> check_tcp
-rwxr-xr-x 1 root root 226192 Apr 16 22:14 check_ups
-rwxr-xr-x 1 root root 181192 Apr 16 22:14 check_uptime
-rwxr-xr-x 1 root root 175336 Apr 16 22:14 check_users
-rwxr-xr-x 1 root root 3040 Apr 16 22:14 check_wave
-rwxr-xr-x 1 root root 172552 Apr 16 22:14 negate
-rwxr-xr-x 1 root root 167232 Apr 16 22:14 urlize
-rwxr-xr-x 1 root root 1954 Apr 16 22:14 utils.pm
-rwxr-xr-x 1 root root 2791 Apr 16 22:14 utils.sh
----------------------------------
Running:
Installed Version: 5.8.3
OS: CentOS 7
Nagios XI
Enterprise License
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Load Balancing Nagios XI

Post by ssax »

Needs proper permissions, please run these commands:

Code: Select all

chown -R nagios.nagios /usr/local/nagios/libexec
service mod-gearman-worker restart
rbuckle
Posts: 146
Joined: Tue Aug 04, 2015 12:25 pm
Location: Idaho
Contact:

Re: Load Balancing Nagios XI

Post by rbuckle »

Still errors
You do not have the required permissions to view the files attached to this post.
----------------------------------
Running:
Installed Version: 5.8.3
OS: CentOS 7
Nagios XI
Enterprise License
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Load Balancing Nagios XI

Post by ssax »

Looks like it's working for some of them, you need to make sure that those plugins used by those checks are on the workers.

What are you seeing in your modgearman logs in /var/log/mod_gearman2/ or /var/log/mod_gearman/?
rbuckle
Posts: 146
Joined: Tue Aug 04, 2015 12:25 pm
Location: Idaho
Contact:

Re: Load Balancing Nagios XI

Post by rbuckle »

should be:

# Path to the logfile.
logfile=/var/lib/log/mod_gearman/mod_gearman_worker.log


in the worker.conf but there is no folder in logs
----------------------------------
Running:
Installed Version: 5.8.3
OS: CentOS 7
Nagios XI
Enterprise License
Locked