Host Status: DOWN (for 0d 1h 11m 6s)
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Host Status: DOWN (for 0d 1h 11m 6s)
split from https://support.nagios.com/forum/viewto ... =7&t=53997
i nagios core all systems setups are fine ,
but suddenly two systems are gone down ,it is showing error as
Host Status: DOWN (for 0d 1h 11m 6s)
Status Information: (Host check timed out after 30.01 seconds)
Current Load
CRITICAL 06-07-2019 08:26:17 3d 2h 55m 14s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
Current Users
CRITICAL 06-07-2019 08:25:54 3d 1h 35m 21s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
PING
CRITICAL 06-07-2019 08:21:52 3d 1h 38m 27s 20/20 CRITICAL - Plugin timed out
Root Partition
OK 06-07-2019 08:23:55 14d 18h 29m 11s 1/20 DISK OK - free space: / 3952 MB (40.14% inode=86%):
SSH
CRITICAL 06-07-2019 08:22:10 3d 1h 39m 8s 20/20 CRITICAL - Socket timeout
Total Processes
CRITICAL 06-07-2019 08:24:43 14d 18h 31m 7s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
disk
CRITICAL 06-07-2019 08:24:43 3d 1h 37m 46s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
mem
CRITICAL 06-07-2019 08:25:47 3d 1h 36m 1s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
in server log files it is showing this error
Warning: Check of host 'nameserver' timed out after 30.27 seconds
wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62;
wproc: Core Worker 14203: job 295 (pid=15921): Dormant child reaped
service=(null);
wproc: CHECK job 294 from worker Core Worker 14204 timed out after 31.02s
for referred this below links still my problem is not resolved kindly please help regarding i am using ubuntu os (i used all steps to clear my error still i am unable get output)
https://support.nagios.com/forum/viewtopic.php?t=50423
https://support.nagios.com/kb/article/n ... s-617.html
can you please help me thank you
i nagios core all systems setups are fine ,
but suddenly two systems are gone down ,it is showing error as
Host Status: DOWN (for 0d 1h 11m 6s)
Status Information: (Host check timed out after 30.01 seconds)
Current Load
CRITICAL 06-07-2019 08:26:17 3d 2h 55m 14s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
Current Users
CRITICAL 06-07-2019 08:25:54 3d 1h 35m 21s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
PING
CRITICAL 06-07-2019 08:21:52 3d 1h 38m 27s 20/20 CRITICAL - Plugin timed out
Root Partition
OK 06-07-2019 08:23:55 14d 18h 29m 11s 1/20 DISK OK - free space: / 3952 MB (40.14% inode=86%):
SSH
CRITICAL 06-07-2019 08:22:10 3d 1h 39m 8s 20/20 CRITICAL - Socket timeout
Total Processes
CRITICAL 06-07-2019 08:24:43 14d 18h 31m 7s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
disk
CRITICAL 06-07-2019 08:24:43 3d 1h 37m 46s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
mem
CRITICAL 06-07-2019 08:25:47 3d 1h 36m 1s 20/20 CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
in server log files it is showing this error
Warning: Check of host 'nameserver' timed out after 30.27 seconds
wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62;
wproc: Core Worker 14203: job 295 (pid=15921): Dormant child reaped
service=(null);
wproc: CHECK job 294 from worker Core Worker 14204 timed out after 31.02s
for referred this below links still my problem is not resolved kindly please help regarding i am using ubuntu os (i used all steps to clear my error still i am unable get output)
https://support.nagios.com/forum/viewtopic.php?t=50423
https://support.nagios.com/kb/article/n ... s-617.html
can you please help me thank you
Re: Host Status: DOWN (for 0d 1h 11m 6s)
Are you able to ping the remote machine from the nagios machine's command line? Can port 5669 be reached? Are you sure that the NRPE agent is up on the remote machine? On the core machine, try running:
Code: Select all
ping <remote_machine_ip>
nmap <remote_machine_ip> -p 5669As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Re: Host Status: DOWN (for 0d 1h 11m 6s)
yes i can able get ping response from both the servers
56(84) bytes of data.
56(84) bytes of data.
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Re: Host Status: DOWN (for 0d 1h 11m 6s)
Nmap 7.60 ( https://nmap.org ) at 2019-06-10 03:56 UTC
Note: Host seems down. If it is really up, but blocking our ping probes, try -Pn
Nmap done: 1 IP address (0 hosts up) scanned in 3.11 seconds
Note: Host seems down. If it is really up, but blocking our ping probes, try -Pn
Nmap done: 1 IP address (0 hosts up) scanned in 3.11 seconds
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Re: Host Status: DOWN (for 0d 1h 11m 6s)
Starting Nmap 7.60 ( https://nmap.org ) at 2019-06-10 03:58 UTC
setup_target: failed to determine route to 5666 (0.0.22.37)
setup_target: failed to determine route to 5666 (0.0.22.37)
Last edited by prasanthaws1 on Sun Jun 09, 2019 11:18 pm, edited 2 times in total.
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Re: Host Status: DOWN (for 0d 1h 11m 6s)
but here i am using my port number is 5666, i am keeping port number of default nrpe is 5666 only, plase help me regarding this
cd /etc/init.d
# default: on
# description: NRPE (Nagios Remote Plugin Executor)
service nrpe
{
flags = REUSE
socket_type = stream
port = 5666
wait = no
user = nagios
group = nagios
server = /usr/local/nagios/bin/nrpe
server_args = -c /usr/local/nagios/etc/nrpe.cfg --init.d
log_on_failure += USERID
disable = no
only_from = server ip nagios
}
cd /etc/init.d
# default: on
# description: NRPE (Nagios Remote Plugin Executor)
service nrpe
{
flags = REUSE
socket_type = stream
port = 5666
wait = no
user = nagios
group = nagios
server = /usr/local/nagios/bin/nrpe
server_args = -c /usr/local/nagios/etc/nrpe.cfg --init.d
log_on_failure += USERID
disable = no
only_from = server ip nagios
}
Last edited by prasanthaws1 on Sun Jun 09, 2019 11:41 pm, edited 1 time in total.
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Re: Host Status: DOWN (for 0d 1h 11m 6s)
tcp 0 0 0.0.0.0:5666 0.0.0.0:* LISTEN
tcp 0 0 55666 ESTABLISHED
tcp6 0 0 :::5666 :::* LISTEN
Host Status: DOWN (for 6d 6h 24m 8s)
Status Information: (Host check timed out after 31.01 seconds)
how to restart host status time (for 6d 6h 24m 8s) from staring from 0 days 0 h 1min )
tcp 0 0 55666 ESTABLISHED
tcp6 0 0 :::5666 :::* LISTEN
Host Status: DOWN (for 6d 6h 24m 8s)
Status Information: (Host check timed out after 31.01 seconds)
how to restart host status time (for 6d 6h 24m 8s) from staring from 0 days 0 h 1min )
Re: Host Status: DOWN (for 0d 1h 11m 6s)
If the port is up and listening and the agent has been configured to allow connections from the nagios machine, then it is likely a firewall preventing the connection. Is there a firewall between the two? What is the output of "iptables -L" on the machine with the agent installed?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
-
prasanthaws1
- Posts: 10
- Joined: Mon May 13, 2019 1:16 am
Re: Host Status: DOWN (for 0d 1h 11m 6s)
this is tmy iptables -L output
Chain INPUT (policy ACCEPT)
target prot opt source destination
ACCEPT tcp -- anywhere anywhere tcp dpt:nrpe
Chain FORWARD (policy ACCEPT)
target prot opt source destination
Chain OUTPUT (policy ACCEPT)
target prot opt source destination
*filter
:INPUT ACCEPT [59374:26344691]
:FORWARD ACCEPT [0:0]
:OUTPUT ACCEPT [63199:14583606]
client nrpe syslog error
systemd-resolved[759]: Server returned error NXDOMAIN, mitigating potential DNS violation DVE-2018-0001, retrying transaction with reduced feature level UDP.
server side error
wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62
-A INPUT -p tcp -m tcp --dport 5666 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 5666 -j ACCEPT
COMMIT
Chain INPUT (policy ACCEPT)
target prot opt source destination
ACCEPT tcp -- anywhere anywhere tcp dpt:nrpe
Chain FORWARD (policy ACCEPT)
target prot opt source destination
Chain OUTPUT (policy ACCEPT)
target prot opt source destination
*filter
:INPUT ACCEPT [59374:26344691]
:FORWARD ACCEPT [0:0]
:OUTPUT ACCEPT [63199:14583606]
client nrpe syslog error
systemd-resolved[759]: Server returned error NXDOMAIN, mitigating potential DNS violation DVE-2018-0001, retrying transaction with reduced feature level UDP.
server side error
wproc: early_timeout=1; exited_ok=0; wait_status=0; error_code=62
-A INPUT -p tcp -m tcp --dport 5666 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 5666 -j ACCEPT
COMMIT
Re: Host Status: DOWN (for 0d 1h 11m 6s)
The nmap command doesn't look like it ran properly. Here is an example:
Replace 192.168.5.5 with the IP address of the client machine.
Code: Select all
nmap 192.168.5.5 -p 5666As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.