Host Status: DOWN (for 0d 1h 11m 6s)

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

split from https://support.nagios.com/forum/viewto ... =7&t=53997

i nagios core all systems setups are fine ,
but suddenly two systems are gone down ,it is showing error as
Host Status: DOWN (for 0d 1h 11m 6s)
Status Information: (Host check timed out after 30.01 seconds)
Current Load

CRITICAL 06-07-2019 08:26:17 3d 2h 55m 14s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
Current Users
CRITICAL 06-07-2019 08:25:54 3d 1h 35m 21s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
PING
CRITICAL 06-07-2019 08:21:52 3d 1h 38m 27s 20/20 CRITICAL - Plugin timed out
Root Partition
OK 06-07-2019 08:23:55 14d 18h 29m 11s 1/20 DISK OK - free space: / 3952 MB (40.14% inode=86%):
SSH
CRITICAL 06-07-2019 08:22:10 3d 1h 39m 8s 20/20 CRITICAL - Socket timeout
Total Processes
CRITICAL 06-07-2019 08:24:43 14d 18h 31m 7s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
disk
CRITICAL 06-07-2019 08:24:43 3d 1h 37m 46s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
mem
CRITICAL 06-07-2019 08:25:47 3d 1h 36m 1s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

in server log files it is showing this error
Warning: Check of host 'nameserver' timed out after 30.27 seconds
wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62;
wproc: Core Worker 14203: job 295 (pid=15921): Dormant child reaped
service=(null);
wproc: CHECK job 294 from worker Core Worker 14204 timed out after 31.02s

for referred this below links still my problem is not resolved kindly please help regarding i am using ubuntu os (i used all steps to clear my error still i am unable get output)

https://support.nagios.com/forum/viewtopic.php?t=50423
https://support.nagios.com/kb/article/n ... s-617.html

can you please help me thank you
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by cdienger »

Are you able to ping the remote machine from the nagios machine's command line? Can port 5669 be reached? Are you sure that the NRPE agent is up on the remote machine? On the core machine, try running:

Code: Select all

ping <remote_machine_ip>
nmap <remote_machine_ip> -p 5669
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

yes i can able get ping response from both the servers

56(84) bytes of data.
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

Nmap 7.60 ( https://nmap.org ) at 2019-06-10 03:56 UTC
Note: Host seems down. If it is really up, but blocking our ping probes, try -Pn
Nmap done: 1 IP address (0 hosts up) scanned in 3.11 seconds
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

Starting Nmap 7.60 ( https://nmap.org ) at 2019-06-10 03:58 UTC
setup_target: failed to determine route to 5666 (0.0.22.37)
Last edited by prasanthaws1 on Sun Jun 09, 2019 11:18 pm, edited 2 times in total.
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

but here i am using my port number is 5666, i am keeping port number of default nrpe is 5666 only, plase help me regarding this


cd /etc/init.d
# default: on
# description: NRPE (Nagios Remote Plugin Executor)
service nrpe
{
flags = REUSE
socket_type = stream
port = 5666
wait = no
user = nagios
group = nagios
server = /usr/local/nagios/bin/nrpe
server_args = -c /usr/local/nagios/etc/nrpe.cfg --init.d
log_on_failure += USERID
disable = no
only_from = server ip nagios
}
Last edited by prasanthaws1 on Sun Jun 09, 2019 11:41 pm, edited 1 time in total.
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

tcp 0 0 0.0.0.0:5666 0.0.0.0:* LISTEN
tcp 0 0 55666 ESTABLISHED
tcp6 0 0 :::5666 :::* LISTEN




Host Status: DOWN (for 6d 6h 24m 8s)
Status Information: (Host check timed out after 31.01 seconds)
how to restart host status time (for 6d 6h 24m 8s) from staring from 0 days 0 h 1min )
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by cdienger »

If the port is up and listening and the agent has been configured to allow connections from the nagios machine, then it is likely a firewall preventing the connection. Is there a firewall between the two? What is the output of "iptables -L" on the machine with the agent installed?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
prasanthaws1
Posts: 10
Joined: Mon May 13, 2019 1:16 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by prasanthaws1 »

this is tmy iptables -L output
Chain INPUT (policy ACCEPT)
target prot opt source destination
ACCEPT tcp -- anywhere anywhere tcp dpt:nrpe

Chain FORWARD (policy ACCEPT)
target prot opt source destination

Chain OUTPUT (policy ACCEPT)
target prot opt source destination

*filter
:INPUT ACCEPT [59374:26344691]
:FORWARD ACCEPT [0:0]
:OUTPUT ACCEPT [63199:14583606]


client nrpe syslog error
systemd-resolved[759]: Server returned error NXDOMAIN, mitigating potential DNS violation DVE-2018-0001, retrying transaction with reduced feature level UDP.

server side error

wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62


-A INPUT -p tcp -m tcp --dport 5666 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 5666 -j ACCEPT
COMMIT
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Host Status: DOWN (for 0d 1h 11m 6s)

Post by cdienger »

The nmap command doesn't look like it ran properly. Here is an example:

Code: Select all

nmap 192.168.5.5 -p 5666
Replace 192.168.5.5 with the IP address of the client machine.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Locked