Cron <root@nagios> LANG=C LC_ALL=C /usr/bin/mrtg /etc/mrtg/mrtg.cfg --lock-file /var/lib/mrtg/mrtg.lock --confcache-file /var/lib/mrtg/mrtg.ok --user=nagios --group=nagios
The email is quite long, and is basically a whole bunch of SNMP failures (example below):
Most, if not all, of the IPs in this email are for network switches that were previously configured to be monitored by Nagios but have since been removed (some removed months ago).SNMPGET Problem for ifInOctets.10102 ifOutOctets.10102 on readonly@IP-address-snipped::::1:v4only: No response from remote host "IP-address-snipped" at /usr/bin/../lib/mrtg2/Net_SNMP_util.pm line 594.
Net_SNMP_util::snmpget('readonly@IP-address-snipped:161::::1:v4only', 'HASH(0x235dfb8)', 'ifInOctets.10102', 'ifOutOctets.10102') called at /usr/bin/mrtg line 2331
main::getsnmparg('HASH(0x1546de8)', 'HASH(0x2361b38)', 'HASH(0x1434fa8)', 'HASH(0x18ea380)') called at /usr/bin/mrtg line 2511
main::readtargets('HASH(0x1546de8)', 'ARRAY(0x15b12a8)', 'HASH(0x1434fa8)') called at /usr/bin/mrtg line 404
main::main called at /usr/bin/mrtg line 144
This brings two questions to mind:
1) Why is the underlying operating system sending emails every 5 minutes for these failures? These aren't actual Nagios application-level notifications. Historically, the VM has been sending emails typically once a day, related to the "automysqlbackup" cron (these don't seem to be errors, but informational with the output of the backup job). We've just ignored those since they are low-volume, but now these mrtg ones are flooding.
2) Why are these older addresses still being queried? Looking on the file system, the /etc/mrtg/conf.d/ folder contains cfg files for some 55 IP addresses. But Nagios itself only has 4 switches configured in it at this time. It seems like Nagios XI is not deleting these files when the corresponding hosts/services are deleted?
As for the timing of this starting to happen this weekend (while the switches were deleted a long time ago), two changes took place:
1) Nagios was updated from 5.8.6 to 5.8.7.
2) The underlying OS packages were updated (yum check-update / yum update).