1. When we ACK an alert, it stays ACK'd for a short period of time before it comes back.
2. We haven't been collecting any information in our event logs (Home --> Monitoring Process --> Event Log).
When I look at the logs, I see a bunch of entries like this:
Code: Select all
Dec 4 08:07:22 fe1 ndo2db: Error: mysql_query() failed for 'INSERT INTO nagios_logentries SET instance_id='1', logentry_time=FROM_UNIXTIME(1449245242), entry_time=FROM_UNIXTIME(1449245242), entry_time_usec='597398', logentry_type='65536', logentry_data='SERVICE ALERT: our_router\.ourcompany\.com;Ping;CRITICAL;SOFT;3;CRITICAL - 10\.75\.24\.211: rta nan, lost 100%', realtime_data='1', inferred_data_extracted='1''
Dec 4 08:07:22 fe1 ndo2db: mysql_error: 'Table './nagios/nagios_logentries' is marked as crashed and last (automatic?) repair failed'I copied the repairmysql.sh script over to the MySQL box and then on the MySQL box and did the following:
Code: Select all
service mysqld stop
repairmysql.sh nagios
service mysqld start
Went back to the Nagios box, restarted ndo2db and Nagios but am still seeing those entries in the logs on the Nagios box.