After resolving this issue this morning by removing the perfdata from the ramdisc partition, I have been getting email notifications all day from Nagios for outages that occurred during the Nagios outage.
Specific example, I got this email 3 minutes ago:
This device is online and all mod_gearman workers are online now.Notification Type: PROBLEM
Host: Pelican Grand Anevia - CDS
State: DOWN
Address: 10.255.24.170
Info: (host check orphaned, is the mod-gearman worker on queue hostgroup_monitoring running?)
Date/Time: 2015-03-05 15:58:27
It seems that Nagios has somehow stored up these notifications and it slowly (all day now) sending them to me.
Looking that this device in the Nagios XI interface shows:
As you can see from the above, "Last State Change" was during the outage at 4:53am, however I just got the email notification 3 minutes ago.Host State: Up
Duration: 11h 10m 30s
State Type: Hard
Current Check: 1 of 3
Last Check: 2015-03-05 16:03:41
Next Check: 2015-03-05 16:05:15
Last State Change: 2015-03-05 04:53:19
Last Notification: Never
Check Type: Active
Check Latency: 0.01639 seconds
Execution Time: 0.31749 seconds
State Change: 0%
Performance Data: time=0.176439s;;;0.000000 size=500B;;;0
I can provide hundreds more examples if needed.
I've checked my mail queue (using postqueue -p ) and it says "Mail queue is empty".
I'm using nagios XI 2014R1.4
How do I clear out these old notifications so I stop getting emails on them?
Thanks for your help.
Best,
Rafael