NAGIOS MONITORING ENGINE WON'T START! HELP!
Posted: Tue Jun 05, 2012 2:01 pm
Hello,
I'm running XI ver 2011R2.4.
So it's a long story but I was working on snmp monitoring for a Cisco switch. I was renaming one of the switch interfaces to something like Batch1 (Private). When I was trying to save the new service I was getting errors that it couldn't save. I went into the configuration snapshot section and I saw in the error logs that Nagios was having a problem with the ( ) in my naming convention.
Next thing I know I'm getting a red explanation point on the top right side of the page saying the monitoring engine isn't running. When I would try to start the service manually in system status, I get an error occurred processing your request. I also restarted the server and I still have the same problem with the service not starting.
When I highlight the red explanation point next to monitoring engine, I get "No lock file found in /usr/local/nagios/var/nagios.lock."
Please let me know what I need to do ASAP. If its command line related please understand that I'm a super noob when it comes to Linux, so I'll need detailed instructions on what needs to be done.
Also, if needed I have taken several config snapshots before this problem, but I'm unsure if that will help, make matters worse, or if I should do the import.
Also -- I just ran the command below based on a previous post and below is what I got. I'm not sure what to do next.
[root@nagios ~]# tail -f /usr/local/nagios/var/nagios.log
[1338912147] Auto-save of retention data completed successfully.
[1338912397] SERVICE ALERT: localhost;Current Load;OK;HARD;4;OK - load average: 1.60, 2.35, 2.80
[1338912397] SERVICE FLAPPING ALERT: localhost;Current Load;STARTED; Service app ears to have started flapping (20.9% change >= 20.0% threshold)
[1338913037] SERVICE NOTIFICATION: nagiosadmin;Blog.dev.vlmc;Memory Usage;WARNIN G;xi_service_notification_handler;WARNING - 100 / 1002 MB (9%) Free Memory, Used : 902 MB, Shared: 0 MB, Buffers: 155 MB, Cached: 266 MB
[1338913385] ndomod: Error writing to data sink! Some output may get lost...
[1338913385] ndomod: Please check remote ndo2db log, database connection or SSL Parameters
[1338913396] Caught SIGTERM, shutting down...
[1338913396] Successfully shutdown... (PID=20368)
[1338913396] ndomod: Shutdown complete.
[1338913396] Event broker module '/usr/local/nagios/bin/ndomod.o' deinitialized successfully.
Thank you!
I'm running XI ver 2011R2.4.
So it's a long story but I was working on snmp monitoring for a Cisco switch. I was renaming one of the switch interfaces to something like Batch1 (Private). When I was trying to save the new service I was getting errors that it couldn't save. I went into the configuration snapshot section and I saw in the error logs that Nagios was having a problem with the ( ) in my naming convention.
Next thing I know I'm getting a red explanation point on the top right side of the page saying the monitoring engine isn't running. When I would try to start the service manually in system status, I get an error occurred processing your request. I also restarted the server and I still have the same problem with the service not starting.
When I highlight the red explanation point next to monitoring engine, I get "No lock file found in /usr/local/nagios/var/nagios.lock."
Please let me know what I need to do ASAP. If its command line related please understand that I'm a super noob when it comes to Linux, so I'll need detailed instructions on what needs to be done.
Also, if needed I have taken several config snapshots before this problem, but I'm unsure if that will help, make matters worse, or if I should do the import.
Also -- I just ran the command below based on a previous post and below is what I got. I'm not sure what to do next.
[root@nagios ~]# tail -f /usr/local/nagios/var/nagios.log
[1338912147] Auto-save of retention data completed successfully.
[1338912397] SERVICE ALERT: localhost;Current Load;OK;HARD;4;OK - load average: 1.60, 2.35, 2.80
[1338912397] SERVICE FLAPPING ALERT: localhost;Current Load;STARTED; Service app ears to have started flapping (20.9% change >= 20.0% threshold)
[1338913037] SERVICE NOTIFICATION: nagiosadmin;Blog.dev.vlmc;Memory Usage;WARNIN G;xi_service_notification_handler;WARNING - 100 / 1002 MB (9%) Free Memory, Used : 902 MB, Shared: 0 MB, Buffers: 155 MB, Cached: 266 MB
[1338913385] ndomod: Error writing to data sink! Some output may get lost...
[1338913385] ndomod: Please check remote ndo2db log, database connection or SSL Parameters
[1338913396] Caught SIGTERM, shutting down...
[1338913396] Successfully shutdown... (PID=20368)
[1338913396] ndomod: Shutdown complete.
[1338913396] Event broker module '/usr/local/nagios/bin/ndomod.o' deinitialized successfully.
Thank you!