Page 1 of 1

ncpa 2.3.1-1 on RHEL7

Posted: Tue Mar 16, 2021 1:34 pm
by cstuart
I noticed that, for some of my hosts, ncpa checks from the XI server are timing out. The common thread for those hosts is that they're running ncpa-2.3.1-1 and the others are running 2.2.2-1.

Are there any known issues?

Re: ncpa 2.3.1-1 on RHEL7

Posted: Tue Mar 16, 2021 2:01 pm
by cstuart
The host shows as up in XI, but all of the service checks come back with " (Service check timed out after 60.01 seconds)".

Re: ncpa 2.3.1-1 on RHEL7

Posted: Wed Mar 17, 2021 7:47 am
by cstuart
I downgraded to 2.3.0 and it still failed to respond. I downgraded, again, back to 2.2.1 and it came back.

Re: ncpa 2.3.1-1 on RHEL7

Posted: Wed Mar 17, 2021 10:54 am
by lmiltchev
What are the operating systems/distros/architectures of the remote machines (clients)? What is the NCPA agent's version that you are running on them?

Code: Select all

/usr/local/nagios/libexec/check_ncpa.py -H <client ip> -t <token> -M 'system/agent_version'
What kind of checks are the failing ones? Can you show us an example of the check that is timing out?

Re: ncpa 2.3.1-1 on RHEL7

Posted: Wed Apr 14, 2021 1:36 pm
by cstuart
Sorry for the delay. It's still happening on 2.3.1

All checks are doing this. CPU Usage, Disk Usage on..., Memory Usage, etc. The all come back with:

Service check timed out after 60.01 seconds

If I roll it back to 2.2.2 it works.

With either version, the host status is fine, but all of the checks/services are what timeout.

Re: ncpa 2.3.1-1 on RHEL7

Posted: Wed Apr 14, 2021 2:47 pm
by cstuart
All hosts are RHEL7. The agent that works is 2.2.2. Once upgraded to 2.3.x, host checks succeed, but all service checks on the host timeout.

Re: ncpa 2.3.1-1 on RHEL7

Posted: Thu Apr 15, 2021 12:01 pm
by ssax
Upgrade one of the systems to NCPA 2.3.1, then edit the ncpa.cfg on the remote system and under the [listener] section change this:

Code: Select all

loglevel = info
To this:

Code: Select all

loglevel = debug
Then restart the ncpa_listener service:

Code: Select all

systemctl restart ncpa_listener
Then run this check from an SSH session on the XI server and attach the full output of the command:

Code: Select all

/usr/local/nagios/libexec/check_ncpa.py -H X.X.X.X -t 'yourtoken' -M 'system/agent_version' -D -v
Then attach your ncpa_listener.log from the remote system after it times out.