I am working around to find the solution, but still i unable to find the solution of my problem. i am using centos7 and installed nagiosXi in my system but on the nagios home page i can see a alert on the top right side which saying that ndo2db is not running, i tried to restart it several time but still the problem is persist.
OUTPUT
[root@localhost subsys]# service ndo2db status
ndo2db is not running but subsystem locked
## TABLE TRIMMING OPTIONS
# Several database tables containing Nagios event data can become quite large
# over time. Most admins will want to trim these tables and keep only a
# certain amount of data in them. The options below are used to specify the
# age (in MINUTES) that data should be allowd to remain in various tables
# before it is deleted. Using a value of zero (0) for any value means that
# that particular table should NOT be automatically trimmed.
# Keep timed events for 24 hours
max_timedevents_age=1440
# Keep system commands for 1 week
max_systemcommands_age=10080
# Keep service checks for 1 week
max_servicechecks_age=10080
# Keep host checks for 1 week
max_hostchecks_age=10080
# Keep event handlers for 31 days
max_eventhandlers_age=44640
# DEBUG LEVEL
# This option determines how much (if any) debugging information will
# be written to the debug file. OR values together to log multiple
# types of information.
# Values: -1 = Everything
# 0 = Nothing
# 1 = Process info
# 2 = SQL queries
debug_level=0
# DEBUG VERBOSITY
# This option determines how verbose the debug log out will be.
# Values: 0 = Brief output
# 1 = More detailed
# 2 = Very detailed
debug_verbosity=1
# DEBUG FILE
# This option determines where the daemon should write debugging information.
debug_file=/usr/local/nagios/var/ndo2db.debug
# MAX DEBUG FILE SIZE
# This option determines the maximum size (in bytes) of the debug file. If
# the file grows larger than this size, it will be renamed with a .old
# extension. If a file already exists with a .old extension it will
# automatically be deleted. This helps ensure your disk space usage doesn't
# get out of control when debugging.
max_debug_file_size=1000000
Let me know what other information i can provide you to help you to know the root cause of the problem.
[root@localhost ~]# rm /usr/local/nagios/var/ndo2db.lock
rm: remove regular empty file ‘/usr/local/nagios/var/ndo2db.lock’? y
[root@localhost ~]# service ndo2db restart
Restarting ndo2db (via systemctl): [ OK ]
[root@localhost ~]# service ndo2db status
ndo2db is not running but subsystem locked
[root@localhost ~]#
Once you have the debug output, do not forget to decrease the debug level in ndo2db.cfg back to defaults.
Was this system restored from a backup to centos/rhel 7 by chance? I ask because you may have issues with the old mysql libs not existing as in cent/rhel 7 they moved to mariadb. If that is the case, lets us know and I will get you the steps to rebuild ndo from the nagiosxi tarball.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
No such file is there at the specified location tail -25 /usr/local/nagios/var/ndo2db.debug
tail -25 /var/log/messages
[root@localhost ~]# tail -25 /var/log/messages
May 28 05:55:02 localhost systemd: Started Session 4053 of user nagios.
May 28 05:55:02 localhost systemd: Starting Session 4052 of user nagios.
May 28 05:55:02 localhost systemd: Started Session 4052 of user nagios.
May 28 05:55:02 localhost systemd: Starting Session 4049 of user nagios.
May 28 05:55:02 localhost systemd: Started Session 4049 of user nagios.
May 28 05:55:02 localhost systemd: Starting Session 4047 of user nagios.
May 28 05:55:02 localhost systemd: Started Session 4047 of user nagios.
May 28 05:55:02 localhost systemd: Starting Session 4048 of user nagios.
May 28 05:55:02 localhost systemd: Started Session 4048 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4063 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4063 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4062 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4062 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4064 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4064 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4061 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4061 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4058 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4058 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4060 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4060 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4057 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4057 of user nagios.
May 28 05:56:01 localhost systemd: Starting Session 4059 of user nagios.
May 28 05:56:01 localhost systemd: Started Session 4059 of user nagios.
tail -25 /usr/local/nagios/var/nagios.log
[root@localhost ~]# tail -25 /usr/local/nagios/var/nagios.log
[1432765507] HOST ALERT: 160.110.246.130;DOWN;HARD;5;CRITICAL - 160.110.246.130: Host unreachable @ 160.110.246.81. rta nan, lost 100%
[1432765507] HOST NOTIFICATION: nagiosadmin;160.110.246.130;DOWN;xi_host_notification_handler;CRITICAL - 160.110.246.130: Host unreachabl e @ 160.110.246.81. rta nan, lost 100%
[1432765539] SERVICE ALERT: 160.110.246.130;Ping;CRITICAL;SOFT;3;CRITICAL - 160.110.246.130: Host unreachable @ 160.110.246.81. rta nan, lost 100%
[1432765600] SERVICE ALERT: 160.110.246.130;Ping;CRITICAL;SOFT;4;CRITICAL - 160.110.246.130: Host unreachable @ 160.110.246.81. rta nan, lost 100%
[1432765659] SERVICE ALERT: 160.110.246.130;Ping;CRITICAL;HARD;5;CRITICAL - 160.110.246.130: Host unreachable @ 160.110.246.81. rta nan, lost 100%
[1432765741] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;CPU Usage;CRITICAL;xi_service_notification_handler;connect to address 192.168 .10.2 and port 12489: Connection refused
[1432765845] Auto-save of retention data completed successfully.
[1432766104] ndomod: Still unable to connect to data sink. 16445 items lost, 5000 queued items to flush.
[1432767022] ndomod: Still unable to connect to data sink. 17221 items lost, 5000 queued items to flush.
[1432767924] ndomod: Still unable to connect to data sink. 17980 items lost, 5000 queued items to flush.
[1432768052] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;Memory Usage;CRITICAL;xi_service_notification_handler;connect to address 192. 168.10.2 and port 12489: Connection refused
[1432768314] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;Drive C: Disk Usage;CRITICAL;xi_service_notification_handler;connect to addre ss 192.168.10.2 and port 12489: Connection refused
[1432768840] ndomod: Still unable to connect to data sink. 18768 items lost, 5000 queued items to flush.
[1432768998] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;Uptime;CRITICAL;xi_service_notification_handler;connect to address 192.168.10 .2 and port 12489: Connection refused
[1432769191] HOST NOTIFICATION: nagiosadmin;160.110.246.130;DOWN;xi_host_notification_handler;CRITICAL - 160.110.246.130: Host unreachabl e @ 160.110.246.81. rta nan, lost 100%
[1432769445] Auto-save of retention data completed successfully.
[1432769639] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;CPU Usage;CRITICAL;xi_service_notification_handler;connect to address 192.168 .10.2 and port 12489: Connection refused
[1432769756] ndomod: Still unable to connect to data sink. 19564 items lost, 5000 queued items to flush.
[1432770658] ndomod: Still unable to connect to data sink. 20324 items lost, 5000 queued items to flush.
[1432771575] ndomod: Still unable to connect to data sink. 21091 items lost, 5000 queued items to flush.
[1432771914] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;Drive C: Disk Usage;CRITICAL;xi_service_notification_handler;connect to addre ss 192.168.10.2 and port 12489: Connection refused
[1432771951] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;Memory Usage;CRITICAL;xi_service_notification_handler;connect to address 192. 168.10.2 and port 12489: Connection refused
[1432772488] ndomod: Still unable to connect to data sink. 21876 items lost, 5000 queued items to flush.
[1432772598] SERVICE NOTIFICATION: nagiosadmin;192.168.10.2;Uptime;CRITICAL;xi_service_notification_handler;connect to address 192.168.10 .2 and port 12489: Connection refused
[1432772791] HOST NOTIFICATION: nagiosadmin;160.110.246.130;DOWN;xi_host_notification_handler;CRITICAL - 160.110.246.130: Host unreachabl e @ 160.110.246.81. rta nan, lost 100%