help! monitoring engine stopped can't restart
Posted: Tue Jul 29, 2014 3:34 pm
We are having a critical problem with our production server and are looking for your help.
The Monitoring engine status bogged down gradually got worse until this a.m. when it stopped scheduling host and service checks altogether. It looks to be a NagiosXI problem when we login to core the checks are running. At this time, the top command show that the ndo2db was taking up 98%-100% of the cpu.
We found several support cases in the NagiosXi support forum that reported a similar problem and tried restarting httpd, nagios, mysqld, postgresql with no success.
As a disclaimer, we upgraded yesterday to the Interface Table v 0.05-1 and pnp4nagios 0.6.21. (unsupported we know, but we have installed and run successfully on 2 test instances)
thanks,
Penny Karr | IT Infrastructure Monitoring
Harvard Vanguard Medical Associates, an Affiliate of Atrius Health
254 Second Avenue | Needham, MA 02494
P (781) 292-1853 | F (781 292-1980 | http://www.harvardvanguard.org
Email: [email protected]
The Monitoring engine status bogged down gradually got worse until this a.m. when it stopped scheduling host and service checks altogether. It looks to be a NagiosXI problem when we login to core the checks are running. At this time, the top command show that the ndo2db was taking up 98%-100% of the cpu.
We found several support cases in the NagiosXi support forum that reported a similar problem and tried restarting httpd, nagios, mysqld, postgresql with no success.
As a disclaimer, we upgraded yesterday to the Interface Table v 0.05-1 and pnp4nagios 0.6.21. (unsupported we know, but we have installed and run successfully on 2 test instances)
thanks,
Penny Karr | IT Infrastructure Monitoring
Harvard Vanguard Medical Associates, an Affiliate of Atrius Health
254 Second Avenue | Needham, MA 02494
P (781) 292-1853 | F (781 292-1980 | http://www.harvardvanguard.org
Email: [email protected]