Hello team
We have few servers i.e. Windows servers.
Two of servers were restarted on 9th Mar.
Server name: Laanice1 restarted around 1733 GMT
Laanice2 restarted around 1933 GMT
Monthly report that nagios sends out show 100% Time up. We trying to understand why it is showing 100% uptime despite the fact that servers were restarted.
Let me know should you need any more details from Nagios. Please find word doc attach.
Thanks.
MK
Hostreport uptime 100% despite servers were restarted
-
mkhan12282
- Posts: 28
- Joined: Sun May 31, 2015 3:02 pm
Hostreport uptime 100% despite servers were restarted
- Attachments
-
Nagios report.docx- (216.43 KiB) Downloaded 259 times
Re: Hostreport uptime 100% despite servers were restarted
Those results are expected if you had scheduled downtime for these servers for maintenance or whatever the reason. Did you schedule downtime for theses servers during the times you mentioned?
Be sure to check out the Knowledgebase for helpful articles and solutions!
-
mkhan12282
- Posts: 28
- Joined: Sun May 31, 2015 3:02 pm
Re: Hostreport uptime 100% despite servers were restarted
Hi
Sorry for the delay in replying back as i was checking if downtime was scheduled.
I can confirm, downtime was not scheduled in nagios.
Would it matter for Nagios reporting if servers got back online within 10 checks duration of 10 minutes? Say ping check, we have this set as followed.
Max check attemps: 10
Normal check interval: 3 min
Retry check Interval: 1 min
Please let me know.
Thanks.
MK
Sorry for the delay in replying back as i was checking if downtime was scheduled.
I can confirm, downtime was not scheduled in nagios.
Would it matter for Nagios reporting if servers got back online within 10 checks duration of 10 minutes? Say ping check, we have this set as followed.
Max check attemps: 10
Normal check interval: 3 min
Retry check Interval: 1 min
Please let me know.
Thanks.
MK
Re: Hostreport uptime 100% despite servers were restarted
your host is rebooting between checks and as such is not being noticed by nagios
if you check once every 3 minutes and your server takes less than 3 minutes to reboot nagios doesnt know and if you use soft then hard states it has even more time
if you check once every 3 minutes and your server takes less than 3 minutes to reboot nagios doesnt know and if you use soft then hard states it has even more time
Looking forward to seeing you all at #NagiosCon2019?
-Dedicated Lover of Nconf,PNP4Nagios and Nagvis
-Dedicated Lover of Nconf,PNP4Nagios and Nagvis
Re: Hostreport uptime 100% despite servers were restarted
nozlaf is right. A scheduled reboot is also not considered downtime for most people. You have a 3 minute check interval and max check attempts set to 10 with 1 minute retries. That means your machine has to be unresponsive for up to 3 minutes plus (10-1) x 1 minutes = 12 minutes before Nagios will alert. My recommendation would be to increase check_interval to five minutes and decrease max check attempts to 3. This means that you would only wait up to 5 minutes plus (3-1) x 1 minutes = 7 minutes for a notification. if the machine is unresponsive.
Eric Loyd • http://everwatch.global • 844.240.EVER • @EricLoyd
I'm a Nagios Fanatic! • Join our public Nagios Discord Server!
Re: Hostreport uptime 100% despite servers were restarted
Thanks @nozlaf & @eloyd!
@mkhan12282 - they are both correct. Let us know if you have any further questions.
@mkhan12282 - they are both correct. Let us know if you have any further questions.
Former Nagios Employee
-
mkhan12282
- Posts: 28
- Joined: Sun May 31, 2015 3:02 pm
Re: Hostreport uptime 100% despite servers were restarted
A big Thank You to all of you.
We can close this ticket
Regards
MK
We can close this ticket
Regards
MK