Restarting the graphing system
Posted: Tue Nov 04, 2025 4:57 pm
Probably twice a year the graphing system in my Ubuntu 22.04-based Nagios XI locks up. The server still runs fine, serves a page, does checks and all the other good stuff Nagios is supposed to do, but the graphing system just stops. In the past, I've rebooted the entire server and graphing begins again, leaving a black hole where the outage was. When troubleshooting, I have a (bad?) reputation for putting graphs and charts in my email to prove my findings and help higher-ups understand what did and did not happen, so missing graph information can be pretty annoying. When this happened again earlier this week, I decided to figure out how to write a cron job to restart the graphing system every night a midnight. To start with, I'm targeting rrdcached and npcd, which seem to be closely involved in the graphing system.
Does this happen to anyone else? If so, how do you deal with it? Also, would it be more effective to just reboot the entire server via cron job? I did just upgrade to 2026R1 today, which I'm hopeful might fix that graphing issue as well.
Does this happen to anyone else? If so, how do you deal with it? Also, would it be more effective to just reboot the entire server via cron job? I did just upgrade to 2026R1 today, which I'm hopeful might fix that graphing issue as well.