Page 1 of 1

Error log show different type of errors

Posted: Mon Aug 19, 2019 7:41 am
by benningtonr
Before the last update to 5.6.3 (i am on 5.6.5 now) my logs looked normal. if there was an error, it would show it once, and then again on the next check. Now for a single error it is in there 3-4 times, with wproc stuff. can some one let me know if this is the new normal or is something not configured correctly.

Thanks

Re: Error log show different type of errors

Posted: Mon Aug 19, 2019 11:29 am
by benjaminsmith
Hello @benningtonr,

Thanks for providing some details about the issue. I have a couple other questions.

1. Are you having any performance issues (e.g High CPU load) or network issues with your server?

2. I noticed one of your hosts is timing out (see image). Try increasing the timeout value in the check command for this host as 30 seconds may not be enough time to get valid results.
host-time-out.png
If you're still having issues, please send us your system profile to review.

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and share in a private message and then reply to this post to bring it up in the queue.

Re: Error log show different type of errors

Posted: Mon Aug 19, 2019 11:41 am
by benningtonr
The timeout is valid, Salesforce was making updates which cause that particular URL to be unresponsive. My question has to do with all the extra logs entries. Six months ago before i updates any thing, i would get 5 critical errors, the the next log entry would be about nagios emailing the contacts that need to be aware of the URL being down. Now it seems i am getting pages of these wproc and kill notices. it is just so cluttered i was wondering if this is the new normal, or is something not configured correctly. I don't want to see 5 entries for one event, just the one " hey, this is critical" and that is all.

As far as the nagios load here are some clips, it looks fine to me, but you guys know a lot more about it than i do.

Thanks again for any and all assistance.

If you still need the config let me know and i will get them sent out.

Re: Error log show different type of errors

Posted: Mon Aug 19, 2019 1:17 pm
by benjaminsmith
Hello @benningtonr,

That does look normal and the entries you seeing in the Event Log are normal when a host or service check times out. We can take a look at your configuration file to check the debug level and settings.

Is Salesforce still making updates? If no, how do the logs look right now?

Also, please post the Nagios configuration file located in ( or send us your system profile):

Code: Select all

/usr/local/nagios/etc/nagios.cfg

Re: Error log show different type of errors

Posted: Mon Aug 19, 2019 1:22 pm
by benningtonr
No, they have corrected the issue, so the logs look fine now, it just seems like a lot of entries just to tell me something is down. Here is the profile if you could take a look to make sure we are all good, thanks

Re: Error log show different type of errors

Posted: Mon Aug 19, 2019 1:24 pm
by benningtonr
I sent file via PM

Re: Error log show different type of errors

Posted: Mon Aug 19, 2019 2:59 pm
by benjaminsmith
Thanks for sending over the system profile. The Event Log did clear up, however, I did notice some errors in the database log.
180712 23:00:03 [ERROR] Got error 127 when reading table './nagios/nagios_servicestatus'
180712 23:00:03 [ERROR] Got error 127 when reading table './nagios/nagios_servicestatus'
180712 23:00:03 [ERROR] /usr/libexec/mysqld: Incorrect key file for table './nagios/nagios_servicestatus.MYI'; try to repair it
Let's go ahead and run the database repair script. Log in as root to the terminal and run the following command:

Code: Select all

/usr/local/nagiosxi/scripts/repair_databases.sh
See: Repairing The Nagios XI Databases

After running the script, you can check the database logs for any errors:

Code: Select all

tail /var/log/mysqld.log
See: Nagios XI Log Locations and Descriptions

The other thing I noticed is that you have used up about 70% disk space on the main partition. Not an immediate concern, but at some point you may have to increase the disk space. If you haven't already, you can setup a check to monitor this.