Email Alerts Coming even if a service is live ?

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Email Alerts Coming even if a service is live ?

Post by soamz »

My email inbox has been bombarded by Nagios alert emails, and to the surprise those devices have never went offline.
Whats the issue ?

Why is Nagios thinking them as dead ?
While the devices are still live.
User avatar
Box293
Too Basu
Posts: 5126
Joined: Sun Feb 07, 2010 10:55 pm
Location: Deniliquin, Australia
Contact:

Re: Email Alerts Coming even if a service is live ?

Post by Box293 »

Can you post an example email alert so we can see what is going on.

In the commands below, replace xxxxx with the name of your host. What is the output of these commands:

Code: Select all

grep -i xxxxx /usr/local/nagios/var/nagios.log
If the day was 7th may that this occurred, lets look at the logs for that day

Code: Select all

grep -i xxxxx /usr/local/nagios/var/archives/nagios-05-07-2016-*.log
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Re: Email Alerts Coming even if a service is live ?

Post by soamz »

Code: Select all

root@jetnms:~# grep -i CSPur_E /usr/local/nagios/var/nagios.log
[1462732200] CURRENT HOST STATE: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462733759] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.27 ms
[1462734089] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462734179] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462734269] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462734359] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462734449] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462734539] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462734629] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462734719] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462734809] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462734899] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462738173] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.33 ms
[1462738807] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462738897] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462738987] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462739077] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462739167] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462739257] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462739347] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462739437] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462739527] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462739616] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462740936] wproc:   host=CSPur_E; service=(null);
[1462740936] Warning: Check of host 'CSPur_E' timed out after 30.00 seconds
[1462742586] HOST FLAPPING ALERT: CSPur_E;STOPPED; Host appears to have stopped flapping (3.9% change < 5.0% threshold)
[1462742916] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462743880] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.29 ms
[1462743880] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.29 ms
[1462744210] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462744300] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462744390] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462744480] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462744570] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462744660] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462744750] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462744840] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462744930] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462745020] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462745020] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462745984] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.30 ms
[1462745984] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.30 ms
[1462746618] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462746618] HOST FLAPPING ALERT: CSPur_E;STARTED; Host appears to have started flapping (20.9% change > 20.0% threshold)
[1462746708] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462746798] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462746888] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462746978] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462747068] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462747158] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462747248] wproc:   host=CSPur_E; service=(null);
[1462747248] Warning: Check of host 'CSPur_E' timed out after 30.01 seconds
[1462747248] HOST ALERT: CSPur_E;DOWN;SOFT;8;(Host check timed out after 30.01 seconds)
[1462747338] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462747428] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462750397] HOST FLAPPING ALERT: CSPur_E;STOPPED; Host appears to have stopped flapping (3.9% change < 5.0% threshold)
[1462750727] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462751361] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.57 ms
[1462751361] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.57 ms
[1462752299] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462752389] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462752479] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462752569] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462752659] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462752749] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462752839] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462752929] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462753019] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462753109] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462753109] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462755089] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462757069] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462759049] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462761029] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462761662] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.26 ms
[1462761662] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.26 ms
[1462762296] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462762386] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462762476] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462762566] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462762656] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462762746] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462762836] wproc:   host=CSPur_E; service=(null);
[1462762836] Warning: Check of host 'CSPur_E' timed out after 30.00 seconds
[1462762836] HOST ALERT: CSPur_E;DOWN;SOFT;7;(Host check timed out after 30.00 seconds)
[1462762926] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462763016] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462763106] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462763106] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462765086] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462765720] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.52 ms
[1462765720] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.52 ms
[1462766962] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462767052] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462767142] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462767232] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462767322] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462767412] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462767502] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462767592] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462767682] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462767772] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462767772] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462769726] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.28 ms
[1462769726] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.28 ms
[1462770056] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462770146] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462770236] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462770326] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462770416] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462770505] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462770595] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462770685] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462770775] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462770865] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462770865] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462771829] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.29 ms
[1462771829] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.29 ms
[1462772463] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462772463] HOST FLAPPING ALERT: CSPur_E;STARTED; Host appears to have started flapping (20.9% change > 20.0% threshold)
[1462772553] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462772643] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462772733] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462772823] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462772913] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462773003] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462773093] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462773183] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462773273] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462774567] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.28 ms
[1462774897] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462774987] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462775077] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462775167] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462775257] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462775321] HOST ALERT: CSPur_E;UP;SOFT;6;PING OK - Packet loss = 0%, RTA = 0.30 ms
[1462776259] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462776349] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462776439] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462776529] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462776619] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462776709] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462776799] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462776889] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462776979] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462777069] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462777703] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.37 ms
[1462778033] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462778123] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462778212] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462778302] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462778366] HOST ALERT: CSPur_E;UP;SOFT;5;PING OK - Packet loss = 0%, RTA = 0.58 ms
[1462779000] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462779090] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462779180] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462779270] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462779360] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462779450] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462779540] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462779630] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462779720] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462779810] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462780135] HOST ALERT: CSPur_E;UP;HARD;10;PING WARNING - Packet loss = 80%, RTA = 0.27 ms
[1462781073] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462781163] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462781253] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462781343] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462781433] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462781523] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462781613] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462781703] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462781793] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462781883] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462782860] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 72%, RTA = 0.28 ms
[1462783190] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462783280] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462783370] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462783460] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462783550] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462783640] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462783730] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462783820] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462783910] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462784000] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462787300] HOST FLAPPING ALERT: CSPur_E;STOPPED; Host appears to have stopped flapping (3.8% change < 5.0% threshold)
[1462787630] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462787934] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.34 ms
[1462787934] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.34 ms
[1462788263] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462788353] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462788443] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462788533] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462788623] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462788713] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462788803] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462788893] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462788983] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462789073] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462789073] HOST NOTIFICATION: nagiosadmin;CSPur_E;DOWN;notify-host-by-email;PING CRITICAL - Packet loss = 100%
[1462789377] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.28 ms
[1462789377] HOST NOTIFICATION: nagiosadmin;CSPur_E;UP;notify-host-by-email;PING OK - Packet loss = 0%, RTA = 0.28 ms
[1462789707] HOST ALERT: CSPur_E;DOWN;SOFT;1;PING CRITICAL - Packet loss = 100%
[1462789707] HOST FLAPPING ALERT: CSPur_E;STARTED; Host appears to have started flapping (21.8% change > 20.0% threshold)
[1462789797] HOST ALERT: CSPur_E;DOWN;SOFT;2;PING CRITICAL - Packet loss = 100%
[1462789887] HOST ALERT: CSPur_E;DOWN;SOFT;3;PING CRITICAL - Packet loss = 100%
[1462789977] HOST ALERT: CSPur_E;DOWN;SOFT;4;PING CRITICAL - Packet loss = 100%
[1462790067] HOST ALERT: CSPur_E;DOWN;SOFT;5;PING CRITICAL - Packet loss = 100%
[1462790157] HOST ALERT: CSPur_E;DOWN;SOFT;6;PING CRITICAL - Packet loss = 100%
[1462790247] HOST ALERT: CSPur_E;DOWN;SOFT;7;PING CRITICAL - Packet loss = 100%
[1462790337] HOST ALERT: CSPur_E;DOWN;SOFT;8;PING CRITICAL - Packet loss = 100%
[1462790427] HOST ALERT: CSPur_E;DOWN;SOFT;9;PING CRITICAL - Packet loss = 100%
[1462790517] HOST ALERT: CSPur_E;DOWN;HARD;10;PING CRITICAL - Packet loss = 100%
[1462790821] HOST ALERT: CSPur_E;UP;HARD;10;PING OK - Packet loss = 0%, RTA = 0.26 ms
root@jetnms:~# 
soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Re: Email Alerts Coming even if a service is live ?

Post by soamz »

Code: Select all

root@jetnms:~# grep -i CSPur-E /usr/local/nagios/var/archives/nagios-05-07-2016-*.log
root@jetnms:~# 
No output for this
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Email Alerts Coming even if a service is live ?

Post by rkennedy »

It looks pretty clear that their was indeed a problem communicating with the service.

As you mentioned though, that it never went down - were you receiving notifications for other hosts / services as well or just this one? It could be resource related. What is the output of top|head -n6?
Former Nagios Employee
soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Re: Email Alerts Coming even if a service is live ?

Post by soamz »

Code: Select all

root@jetnms:~# route -n
Kernel IP routing table
Destination     Gateway         Genmask         Flags Metric Ref    Use Iface
0.0.0.0         103.194.232.1   0.0.0.0         UG    0      0        0 eth0
7.7.7.0         0.0.0.0         255.255.255.0   U     0      0        0 eth1
10.10.10.0      0.0.0.0         255.255.255.0   U     0      0        0 eth1
10.10.11.0      0.0.0.0         255.255.255.0   U     0      0        0 eth1
103.194.232.0   0.0.0.0         255.255.255.240 U     0      0        0 eth0
root@jetnms:~# 
soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Re: Email Alerts Coming even if a service is live ?

Post by soamz »

Code: Select all

root@jetnms:~# top|head -n6

top - 00:29:06 up 2 days,  8:54,  1 user,  load average: 0.15, 0.72, 0.87
Tasks: 198 total,   1 running, 197 sleeping,   0 stopped,   0 zombie
%Cpu(s):  7.3 us,  1.8 sy,  0.0 ni, 90.6 id,  0.2 wa,  0.0 hi,  0.0 si,  0.0 st
KiB Mem:  16428244 total,  6298504 used, 10129740 free,   210016 buffers
KiB Swap: 16770044 total,        0 used, 16770044 free.  4990512 cached Mem

root@jetnms:~# 

soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Re: Email Alerts Coming even if a service is live ?

Post by soamz »

If you see my cfg file, I have like 100+ hosts.
And this issue happens for like 20-25 devices.

Others have no issues.
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Email Alerts Coming even if a service is live ?

Post by rkennedy »

https://support.nagios.com/forum/viewtopic.php?t=38318 - reference to closed thread.

Can you provide the IP's of some of the 20-25 devices that aren't working? It doesn't look to be a resource issue.
But if I try to ping any random devices, it doesnt ping. Then again it starts pinging after 5 mins, again stops, again pings in 1 day. Its all random ghost!
When you are pinging, is it from the Nagios machine's CLI? This shouldn't be something that is caused by Nagios. Do you have any equipment in play that prevents against ICMP flooding?
Former Nagios Employee
soamz
Posts: 42
Joined: Sat May 07, 2016 8:29 am

Re: Email Alerts Coming even if a service is live ?

Post by soamz »

1. Do you want me to paste the private IP s here ?

2. Im logged into the server from terminal and trying to ping always to test.

3. I have more than 1200+ devices , routers switches, in the network. How to find out if anyone prevents ICMP flooding or what ?
Locked