We are running Nagios XI 5.8.6 and the monitoring engine keeps dying randomly. Im not finding errors in the nagios log?
Ive done some digging in the forum but all seem to be a case by case basis.
Monitoring Engines keeps dying randomly
Re: Monitoring Engines keeps dying randomly
Hello @isadmin
Thanks for reaching out, and want to take a look at the System Profile so we can see what is going on.
To send us your system profile.
Perry
Thanks for reaching out, and want to take a look at the System Profile so we can see what is going on.
Code: Select all
journalctl -u nagios.service -o verbose > /tmp/journal.txt
- Login to the Nagios XI GUI using a web browser.
- Click the "Admin" > "System Profile" Menu
- Click the "Download Profile" button
- Save the profile.zip file and share both '/tmp/journal.txt and profile.zip in a private message
Perry
Re: Monitoring Engines keeps dying randomly
Perry
I sent the journal via PM but Profile is over 20mb and cannnot be sent via PM. Is there another way I can get that file to you?
I sent the journal via PM but Profile is over 20mb and cannnot be sent via PM. Is there another way I can get that file to you?
Re: Monitoring Engines keeps dying randomly
Perry on a separate note we have to keep deleting the /var/log/php-fpm/www-error.log it keeps filling the drive.
Re: Monitoring Engines keeps dying randomly
Hello @isadmin
Thanks for following up, please use the split command and [PM] them separately.
Please send the 'systemprofileincremented_alphabet file in separate PM.
I will look into the '/var/log/php-fpm/www-error.log' issue and follow up.
Thanks,
Perry
Thanks for following up, please use the split command and [PM] them separately.
Code: Select all
split -b 45M profile.zip /tmp/systemprofile
I will look into the '/var/log/php-fpm/www-error.log' issue and follow up.
Thanks,
Perry
Re: Monitoring Engines keeps dying randomly
Perry I have PM you all 3 files for the Profile.
Thanks
Thanks
Re: Monitoring Engines keeps dying randomly
Hello @isadmin
Thanks for sending over the System Profile, in review we see issues where Performance data is going into timeout and then "sig error".
Here is a support article that walks you through the crucial points on how to optimize.
Change the "load_threshold" to 20 in the "/usr/local/nagios/etc/pnp/npcd.cfg" file:
and restart npcd:
I am not entirely clear why we see increased logging in the '/var/log/php-fpm/www-error.log.' Perhaps an admin turned (debug or verbose) logging on. Option to verify the configuration for the logging and either disable logging or implement logrotation on it. This will point you to the config location:
If changes are made please bounce the httpd (apache) services.
Thanks,
Perry
Thanks for sending over the System Profile, in review we see issues where Performance data is going into timeout and then "sig error".
Here is a support article that walks you through the crucial points on how to optimize.
Change the "load_threshold" to 20 in the "/usr/local/nagios/etc/pnp/npcd.cfg" file:
Code: Select all
load_threshold = 20.0
Code: Select all
service npcd restart
Code: Select all
grep -Eir 'www-error' /etc/httpd/ /etc/php* -A 2
Thanks,
Perry
Re: Monitoring Engines keeps dying randomly
Thanks Perry I made the changes and turned off logging in the cfg.
;php_admin_value[sendmail_path] = /usr/sbin/sendmail -t -i -f www@my.domain.com
;php_flag[display_errors] = off
;php_admin_value[error_log] = /var/log/php-fpm/www-error.log
;php_admin_flag[log_errors] = on
;php_admin_value[memory_limit] = 128M
We will monitor and see. It usually crashes every few days or so.
;php_admin_value[sendmail_path] = /usr/sbin/sendmail -t -i -f www@my.domain.com
;php_flag[display_errors] = off
;php_admin_value[error_log] = /var/log/php-fpm/www-error.log
;php_admin_flag[log_errors] = on
;php_admin_value[memory_limit] = 128M
We will monitor and see. It usually crashes every few days or so.
Re: Monitoring Engines keeps dying randomly
Hello @isadmin
Thanks for following, please let us know how things are looking in a couple of days.
Regards,
Perry
Thanks for following, please let us know how things are looking in a couple of days.
Regards,
Perry