Monitoring Engine Not Working on DR Server (ramdisk errors)
Posted: Wed Oct 16, 2024 11:50 am
Hello all,
On our Nagios disaster recovery (DR) server, we normally keep the monitoring engine turned off (Admin | System Status, click icon to start or stop). This morning I updated the server to Nagios 2024R1.3 and thought it would be a good idea to check that Nagios was working properly. I checked the host and service status pages and found that the 'Last Checked' date was not updating.
Checking status with systemctl gives this:
Checking ramdisk.service status, I get this:
I can do immediate checks and 'Last Check' is updated accordingly. I also updated our Nagios development server and it is working fine and does not get these errors. Servers were recently migrated from CentOS to RHEL 9 (finished last July).
Any assistance resolving this issue would truly be appreciated. Please let me know if you need additional information.
Thank you and best regards,
Steve
On our Nagios disaster recovery (DR) server, we normally keep the monitoring engine turned off (Admin | System Status, click icon to start or stop). This morning I updated the server to Nagios 2024R1.3 and thought it would be a good idea to check that Nagios was working properly. I checked the host and service status pages and found that the 'Last Checked' date was not updating.
Checking status with systemctl gives this:
Code: Select all
[root@C155MNAG02 ~]# systemctl status nagios
● nagios.service - Nagios Core 4.5.3
Loaded: loaded (/usr/lib/systemd/system/nagios.service; disabled; preset: disabled)
Active: active (running) since Wed 2024-10-16 12:26:48 EDT; 10min ago
Docs: https://www.nagios.org/documentation
Process: 6820 ExecStartPre=/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg (code=exited, status=0/SUCCESS)
Process: 6821 ExecStart=/usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg (code=exited, status=0/SUCCESS)
Main PID: 6822 (nagios)
Tasks: 6 (limit: 48799)
Memory: 24.4M
CPU: 1.609s
CGroup: /system.slice/nagios.service
├─6822 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
├─6823 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─6824 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─6825 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─6826 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
└─6842 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
Oct 16 12:36:08 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to rename file '/usr/local/nagios/var/nagios.tmpOEn8Gf' to '/ramdisk/status.dat': Permission denied
Oct 16 12:36:08 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to update status data file '/ramdisk/status.dat': Permission denied
Oct 16 12:36:18 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to rename file '/usr/local/nagios/var/nagios.tmpsXf9v9' to '/ramdisk/status.dat': Permission denied
Oct 16 12:36:18 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to update status data file '/ramdisk/status.dat': Permission denied
Oct 16 12:36:28 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to rename file '/usr/local/nagios/var/nagios.tmpA4ar8A' to '/ramdisk/status.dat': Permission denied
Oct 16 12:36:28 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to update status data file '/ramdisk/status.dat': Permission denied
Oct 16 12:36:38 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to rename file '/usr/local/nagios/var/nagios.tmphYGgpE' to '/ramdisk/status.dat': Permission denied
Oct 16 12:36:38 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to update status data file '/ramdisk/status.dat': Permission denied
Oct 16 12:36:48 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to rename file '/usr/local/nagios/var/nagios.tmpqFHViX' to '/ramdisk/status.dat': No such file or directory
Oct 16 12:36:48 C155MNAG02.vornadort.com nagios[6822]: Error: Unable to update status data file '/ramdisk/status.dat': No such file or directory
Code: Select all
[root@C155MNAG02 ~]# systemctl status ramdisk
● ramdisk.service - Ramdisk
Loaded: loaded (/usr/lib/systemd/system/ramdisk.service; enabled; preset: disabled)
Active: active (exited) since Wed 2024-10-16 12:06:15 EDT; 43min ago
Process: 870 ExecStartPre=/usr/bin/mkdir -p -m 775 /var/nagiosramdisk /var/nagiosramdisk/tmp /var/nagiosramdisk/spool /var/nagiosramdisk/spool/checkresults /var/nagiosramdisk/spool/xidp>
Process: 875 ExecStartPre=/usr/bin/mount -t tmpfs -o size=500m tmpfs /var/nagiosramdisk (code=exited, status=0/SUCCESS)
Process: 876 ExecStartPre=/usr/bin/mkdir -p -m 775 /var/nagiosramdisk /var/nagiosramdisk/tmp /var/nagiosramdisk/spool /var/nagiosramdisk/spool/checkresults /var/nagiosramdisk/spool/xidp>
Process: 879 ExecStart=/usr/bin/chown -R nagios:nagios /var/nagiosramdisk (code=exited, status=0/SUCCESS)
Main PID: 879 (code=exited, status=0/SUCCESS)
CPU: 8ms
Any assistance resolving this issue would truly be appreciated. Please let me know if you need additional information.
Thank you and best regards,
Steve