Understanding nagios logs

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
ericssonvietnam
Posts: 239
Joined: Mon Jun 27, 2016 11:05 pm

Understanding nagios logs

Post by ericssonvietnam »

I would like to know, how this packet loss is calculated by Nagios server in terms as how many packets were transmitted and how many dropped leading to % packet loss.
Jatin_Nagios.png

define service{
use generic-service
hostgroup_name nigeria
service_description Service ping
check_command check_ping!3000.0,20%!5000.0,30%
max_check_attempts 1
check_interval 2
retry_interval 2
check_period 24x7
notification_interval 2
notification_period 24x7
retain_nonstatus_information 1
notification_options w,c,r
contact_groups admins
}
You do not have the required permissions to view the files attached to this post.
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Understanding nagios logs

Post by cdienger »

By default there are 5 icmp packets sent so 20% would mean that 1 of the 5 was dropped.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
ericssonvietnam
Posts: 239
Joined: Mon Jun 27, 2016 11:05 pm

Re: Understanding nagios logs

Post by ericssonvietnam »

cdienger wrote:By default there are 5 icmp packets sent so 20% would mean that 1 of the 5 was dropped.

what about the RTA


how we calculate getting this in the Screenshot shared. Please specify.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Understanding nagios logs

Post by scottwilkerson »

ericssonvietnam wrote: what about the RTA


how we calculate getting this in the Screenshot shared. Please specify.
RTA "round trip average" is the amount of time it took "ON AVERAGE" for those 5 pings to reach their destination and echo a return.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
ericssonvietnam
Posts: 239
Joined: Mon Jun 27, 2016 11:05 pm

Re: Understanding nagios logs

Post by ericssonvietnam »

scottwilkerson wrote:
ericssonvietnam wrote: what about the RTA


how we calculate getting this in the Screenshot shared. Please specify.
RTA "round trip average" is the amount of time it took "ON AVERAGE" for those 5 pings to reach their destination and echo a return.


+++++++++++++++

First let’s understand how ping actually works.
 Ping works by sending an Internet Control Message Protocol (ICMP) Echo Request to a specified interface on the network and waits for a reply.

So it is very much possible that with a very slight latency in network, after Nagios sends the last (5th packet), it closes the ping execution, but the response is still awaited. In other words, the last packet response is received after ping execution is closed. Thus showing us packet loss of 20% (1 out of 5). {{Please point out if Nagios is using a different mechanism for ping test}}
The above scenario can also be observed when we manually use the ping command and do ctrl+C to stop the ping.

Thus 20% packet loss should not be raised as a network issue and requires more troubleshooting at automation end.

+++++++++++++++++
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Understanding nagios logs

Post by scottwilkerson »

You can set the percentage you want for warning and critical. you could require it to be 100%
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
ericssonvietnam
Posts: 239
Joined: Mon Jun 27, 2016 11:05 pm

Re: Understanding nagios logs

Post by ericssonvietnam »

scottwilkerson wrote:You can set the percentage you want for warning and critical. you could require it to be 100%
Please suggest how to set the new warning threshold in ping.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Understanding nagios logs

Post by scottwilkerson »

ericssonvietnam wrote: Please suggest how to set the new warning threshold in ping.
Can you show your current configuration for this service in Configure -> CCM -> Services -> Edit (this service)

Then I can make suggestions
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
ericssonvietnam
Posts: 239
Joined: Mon Jun 27, 2016 11:05 pm

Re: Understanding nagios logs

Post by ericssonvietnam »

scottwilkerson wrote:
ericssonvietnam wrote: Please suggest how to set the new warning threshold in ping.
Can you show your current configuration for this service in Configure -> CCM -> Services -> Edit (this service)

Then I can make suggestions
I have configured the same in the core.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Understanding nagios logs

Post by scottwilkerson »

I'm sorry I didn't see it in your first post, if you change dthe check_command to the following it would require 100% packet loss to be critical

Code: Select all

check_command check_ping!3000.0,100%!5000.0,100%
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked