Page 1 of 1

Frequently Getting alerts

Posted: Fri Aug 31, 2018 5:57 am
by Mani.Murugesan
Hello Team,

one windows server we are getting alerts frequently and the alert is showing the error " WARNING:No data was received from host!"
That server is a backup server but backup scheduling time and alert time is not matching . sometimes alerts time is matched with backup time .

Every alternative days we are getting alerts frequently .
For example today I received alerts for H drive and after sometime it's started working and tomorrow I will receive alerts for I drive.

Could you please help me to resolve it ?

Re: Frequently Getting alerts

Posted: Fri Aug 31, 2018 11:01 am
by lmiltchev
Can you show us how the "failing" services are configured? What is the version of the NSClient++ agent that you are using on the Windows machines? Have you tried restarting the NSClient++ service to see if this is going to resolve the issue? Can you check the nsclient.log for any warnings or errors pertaining to these checks?

Re: Frequently Getting alerts

Posted: Mon Sep 03, 2018 3:28 am
by Mani.Murugesan
Hello,

PFB Screen shot for the service configuration and state history report for failing services.

As per nagios recommendations for all the failing services we have set the Check interval - 5, retry interval -1, max check attempts -5.

Yes we have already tried restarting nsclient++ service but issue not resolved

nsclient version:0.3.9.328

Re: Frequently Getting alerts

Posted: Tue Sep 04, 2018 11:24 am
by lmiltchev
Maybe your Windows server gets really busy, especially during backups, and your check is timing out. Try the following:

1. Increase the "default" value of 30 for the timeout under the [/settings/default] section in the nsclient.ini file on the Windows machine:

Code: Select all

timeout = 60
Save, exit, and restart the nsclient++ service.

2. Modify your "Drive H: Disk Usage" check by adding a timeout to it.

Example:
/usr/local/nagios/libexec/check_nt -H x.x.x.x -s "password" -p 12489 -t 59 -v USEDDISKSPACE -l H -w 80 -c 90
Note: I used a timeout of 59 seconds as the default service timeout is set at 60 in the nagios.cfg file anyway. :)

Code: Select all

service_check_timeout=60
Let us know if increasing the timeout helps.

Note: If 60 seconds is still not enough, you may need to increase it even more, but you would need to change the service_check_timeout value in the nagios.cfg file too.

Re: Frequently Getting alerts

Posted: Mon Sep 10, 2018 8:14 am
by Mani.Murugesan
Hello,

Now that server is working fine since few days. so we will keep it in under observation . if any issue will come I will try to increase the timeout and check it out.

Re: Frequently Getting alerts

Posted: Mon Sep 10, 2018 10:02 am
by lmiltchev
Sounds good! Let us know if you want us to keep the topic open for the time being. Thank you!

Re: Frequently Getting alerts

Posted: Mon Sep 10, 2018 1:04 pm
by Mani.Murugesan
yes...keep the topic open for the time being . let you know once it's resolved completely

Re: Frequently Getting alerts

Posted: Mon Sep 10, 2018 1:30 pm
by lmiltchev
Noted.