Page 1 of 1

Intermittent CPU usage alerts saying "No answer from host"

Posted: Tue Oct 23, 2018 2:28 pm
by pbsindian
Hi Team,
We are monitoring all our Unix and Windows servers using SNMP. Intermittently we are seeing "No answer from host" error for CPU usage check alone. In some cases check fails for couple of times and recovers and in some cases it will fail for longer time and sends a notification and later recovers.

Please let us know what could be the issue and fix.


Type Date / Time Information
Service Recovery 2018-10-23 04:32:00 SERVICE ALERT: whcinsXXX01;CPU Usage;OK;HARD;1;120 CPU, average load 1.0% < 89% : OK
Service Unknown 2018-10-23 04:30:58 SERVICE ALERT: whcinsXXX01;CPU Usage;UNKNOWN;SOFT;2;No answer from host
Service Unknown 2018-10-23 04:29:54 SERVICE ALERT: whcinsXXX01;CPU Usage;UNKNOWN;SOFT;1;No answer from host


Thanks,
Bhargava

Re: Intermittent CPU usage alerts saying "No answer from hos

Posted: Wed Oct 24, 2018 4:22 pm
by cdienger
The plugin could be timing out. You can increase the default 5 second time out using the "-t" option. For example "-t 30" added to the check will set the timeout to 30 seconds.

Re: Intermittent CPU usage alerts saying "No answer from hos

Posted: Wed Oct 24, 2018 4:57 pm
by pbsindian
Sure. Thank you. We will add timeout parameter and monitor for couple of days.

Thanks,
Bhargava

Re: Intermittent CPU usage alerts saying "No answer from hos

Posted: Thu Oct 25, 2018 12:13 pm
by cdienger
Sounds good! We'll be here :)