5.7.2 Monitoring Engine Stops reason wproc: iocache_read

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
Ludwik.Warszawa
Posts: 5
Joined: Wed Apr 22, 2020 6:51 am

5.7.2 Monitoring Engine Stops reason wproc: iocache_read

Post by Ludwik.Warszawa »

Hello,

I'm experiencing weird things with nagios xi 5.7.2 - the monitoring engine stop - it just dies

It happens on various times (usually evenings) and monitoring engine keeps being in this state until 5:00AM

There is unusuall info in logs (many many lines).

Code: Select all

[1596131239] wproc: iocache_read() from Core Worker 32513 returned -1: Bad address
[1596131239] wproc: iocache_read() from Core Worker 32513 returned -1: Bad address
[1596131239] wproc: iocache_read() from Core Worker 32513 returned -1: Bad address
The watchdog nagios (other nagios watching this one does not report a thing, not anymore)

It happens since update to 5.6.14 - but sometimes there was a sigterm in log as last line - I wrote script which looks for last line content and notifies me about occurence of sigterm- but recent events are without that.

We (me and my cooworker Dariusz) migrated to 5.70 from 5.6.14 then to 5.7.1 and finally to 5.7.2 - so we are always fresh.

I've red info about solution but it is about 5.6.x not 5.7.x - any hint?
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: 5.7.2 Monitoring Engine Stops reason wproc: iocache_rea

Post by benjaminsmith »

Hi Ludwik,

What is the total check load of the sever ( host + services )? I'd like to get a system profile, to review the logs.

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and share in a private message or upload it to the post/ticket, and then reply to this post to bring it up in the queue.

In the meantime, let's increase the number of processes allowed. Edit the /etc/security/limits.conf file and add the following to the bottom of the file.

Code: Select all

*   soft   nproc  262144
*   hard   nproc  262144
Save the change and reboot the server for the change to take effect, and let me know if you notice any improvement.

Reference:

How to set nproc (Hard and Soft) Values in CentOS / RHEL 5,6,7
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked