Page 1 of 1

NagiosXI dedicating unresponsive windows server

Posted: Wed Jun 19, 2019 9:28 pm
by imadc
Hello Support

When Host Windows Server with NCPA Agent is undergoing 100% CPU usage

Is there a way for NagiosXI to alert us if a windows server is not responding since ping and telnet are working normally in this case?

or Is there a way to that NagiosXI know that no available resources for an NCPA agent to run on a windows server?

Thanks
Insurity

Re: NagiosXI dedicating unresponsive windows server

Posted: Thu Jun 20, 2019 11:15 am
by lmiltchev
Don't NCPA checks fail when there are "no available resources for an NCPA agent to run on a windows server"? Do you get notifications for the services in hard critical state? What is the status of the "CPU Usage" service?

Re: NagiosXI dedicating unresponsive windows server

Posted: Thu Jun 20, 2019 1:12 pm
by imadc
So what I am getting with 100% CPU on the host

is all NCPA checks returning delayed feed back of - (Service check timed out after 60.01 seconds)

Is there another way to get notified earlier on such use-case? for example another check not through NCPA?

Below is the email that is sent which is not clear that the CPU is 100% and that is why the agent wasnt able to processes monitoring
***** Nagios XI Alert *****
Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: Srv H Grp - Disk Usage on C:/
Host:
Address:
State: CRITICAL
Info:
(Service check timed out after 61.01 seconds)
Date/Time: 2019-06-20 13:54:5
As for the PING it's status is OK

Thanks
Insurity

Re: NagiosXI dedicating unresponsive windows server

Posted: Thu Jun 20, 2019 2:14 pm
by lmiltchev
Is there another way to get notified earlier on such use-case? for example another check not through NCPA?
Checking internals (various metrics on your server) would always require some type of agent. Even with "agentless" monitoring, e.g. using wmi or snmp, the system would need to have enough resources to perform the checks... NCPA does not require lots of CPU power to run, so I don't think using a different agent would solve the issue.

I would recommend that you find out what is causing the CPU spikes on the Windows server, and fix the issue, so that you could successfully monitor various metrics - disk usage, memory, etc.

Are you using active or passive NCPA checks? Can you post the ncpa_listener.log, ncpa_passive.log, win32service_ncpalistener.log, and win32service_ncpapassive.log on the forum. Please remove/obfuscate sensitive data.

Re: NagiosXI dedicating unresponsive windows server

Posted: Fri Jun 21, 2019 8:40 am
by imadc
Thanks

I will give a try through WMI call since the agent didnt have resources to run in this use case

Re: NagiosXI dedicating unresponsive windows server

Posted: Fri Jun 21, 2019 8:59 am
by lmiltchev
Sounds good. We will keep the thread for the time being. Thank you!