No_Response from Remote Host

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
User avatar
Kriyeshh
Posts: 18
Joined: Wed May 13, 2015 5:15 pm
Location: India

Re: No_Response from Remote Host

Post by Kriyeshh »

Yes Mr.Jdalrymple.

The outage happening is around for 4 minutes for mount services alone and the service is been checked every minute with an with max_check_attempt set as 3.
So what's happening here is after 3rd recheck attempt nagios is sending an alert... That is on 3rd minute.

Meanwhile on the 4th second the service gets recovered and again recovery notification is been sent to us. From which we can find on 4th minute the service is Ok. So as you suggested if i increase the max_check_attempt to 5, i may get a temporary solution as service is recovered on 4th minute.

Am i right?

Meanwhile let me find the cron status and process status as static fix!!
Cheers,
-Kriyeshh
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: No_Response from Remote Host

Post by jdalrymple »

Kriyeshh wrote:So as you suggested if i increase the max_check_attempt to 5, i may get a temporary solution as service is recovered on 4th minute.

Am i right?
Yup
User avatar
Kriyeshh
Posts: 18
Joined: Wed May 13, 2015 5:15 pm
Location: India

Re: No_Response from Remote Host

Post by Kriyeshh »

Thanks Mr.Jdalrymple.
Will make the changes and will come back along with the Process and Cron list.
Cheers,
-Kriyeshh
User avatar
hsmith
Agent Smith
Posts: 3539
Joined: Thu Jul 30, 2015 11:09 am
Location: 127.0.0.1
Contact:

Re: No_Response from Remote Host

Post by hsmith »

Thanks, keep us posted.
Former Nagios Employee.
me.
User avatar
Kriyeshh
Posts: 18
Joined: Wed May 13, 2015 5:15 pm
Location: India

Re: No_Response from Remote Host

Post by Kriyeshh »

Hi Friends,

I updated my cfg file and extended the max_check_attempts value to 5, which stopped my alert for time being.
But anyhow i have a service_out for 3 minutes as mentioned before.
And suggested by Mr.Tgriep i ran over my system processes and found that DATABASE_backup is been executed on that particular time.
The DB process is one of the essential process so that i cannot quit it, other hand i want to fix this service_out issue too.
Please suggest how to move forward.
Cheers,
-Kriyeshh
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: No_Response from Remote Host

Post by tgriep »

At this point you have to talk to the manufacturer of the device to find out why it doesn't respond to SNMP polls while the backup is happening.
Be sure to check out our Knowledgebase for helpful articles and solutions!
User avatar
Kriyeshh
Posts: 18
Joined: Wed May 13, 2015 5:15 pm
Location: India

Re: No_Response from Remote Host

Post by Kriyeshh »

Mr.tgriep what if the infrastructure is cloud? Should i approach my Cloud service provider?
Cheers,
-Kriyeshh
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: No_Response from Remote Host

Post by lmiltchev »

To be honest with you, I am not sure what else you can do. Have you tried determining what is the CPU & Memory utilization during these 3-5 minutes, while the database backup is running?
Be sure to check out our Knowledgebase for helpful articles and solutions!
User avatar
Kriyeshh
Posts: 18
Joined: Wed May 13, 2015 5:15 pm
Location: India

Re: No_Response from Remote Host

Post by Kriyeshh »

Hi Mr. lmiltchev,
For you information,I have already tried CPU and Memory utilization for the quoted downtime and its being already shared. To be open a database based backup process is running on these minutes, according to system process output summary.

Currently i have extended the max_check_attempt and i dint get any alerts but still have timeout in my logs.
Should i go for priority or something for Nagios service on that server?
Cheers,
-Kriyeshh
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: No_Response from Remote Host

Post by jdalrymple »

Kriyeshh wrote:Should i go for priority
IMO this is a next best option.

Code: Select all

nice /path/to/database/backup/script.sh
should be adequate.
Locked