Page 1 of 1

Some disk checks falsely failing on one server

Posted: Thu Sep 05, 2024 2:58 pm
by adamsj
Hi, folks,

We've got one Linux node which has nine disk checks on it. Of those nine, two are returning good results and seven are returning "ERROR: Description/Type table : No response from remote host <hostname>". According to inspecting the node, all nine should return good.

I'm puzzled by this, since it's only a few disk checks and only on one node. I've tried restarting the Nagios service, forcing the checks manually, and bouncing the Nagios app node. Nothing helped.

Your thoughts?

Thanks,

John A

Re: Some disk checks falsely failing on one server

Posted: Fri Sep 06, 2024 10:46 am
by bbahn
Hello @adamsj,

It seems that your check is likely being blocked by a firewall. Can you check this with telnet/traceroute?

Re: Some disk checks falsely failing on one server

Posted: Wed Sep 18, 2024 4:59 pm
by adamsj
bbahn wrote: Fri Sep 06, 2024 10:46 am Hello @adamsj,

It seems that your check is likely being blocked by a firewall. Can you check this with telnet/traceroute?
Hi, bbahn,

It turns out that somehow--and I'd love to know how--when we adjust a service for a host, some of that hosts entries in /usr/local/nagios/etc/services revert to an old SNMP community string which I thought we'd gotten flushed from the system.

Thanks,

John A

Re: Some disk checks falsely failing on one server

Posted: Thu Sep 19, 2024 10:23 am
by bbahn
Can you explicitly explain what change you made and what change you saw?