command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

CBoekhuis · Post by **CBoekhuis** » Thu Apr 10, 2025 8:20 am

Hi,

I just upgrade our test cluster from 2024R1.3.1 to 2024R1.3.2. Unfortunately the job scheduling from the command subsystem has gone "crazy".
The 5 jobs keeps resetten the 'next run time" to a fixed time/date in the past, which causes the jobs to run again immediately.

Nagios 2024R1.3.2
2 server (VM) cluster running Red Hat 9.5

I've tried the obvious stop elasticsearch on both nodes and then reboot both nodes. No change.
I've tried the Reset All jobs, but as soon as a job runs, it will set the next run time to the value of approximately the actual start time. Or so it seems.
I can edit a job to run much later which will stop it to run, but as soon as it runs it again will set the next run time to the value of approximately the actual start time causing it to go in a run loop.

First question, How can I disable the job scheduling manually all together for now? This to stop waisting resources for the moment.

What I've noticed for now:
- the 3 jobs cleanup_cmdsubsys, run_all_alerts and run_index_usage wil restart +/- every second.
- the 2 jobs backups and snapshots_maintenance will start a job every minute regardless if there's allready a process running
- the run_update_check job I can't figure out yet. At first it ran a few times per minute, but now it seems stuck in a running state.

Here's a screenshot of my situation, the backups and snapshots_maintenance jobs are rescheduled for in the future, they will eat up my system if they run every minute. The cleanup_cmdsubsys job I rescheduled to 04/10/2025 14:43:25, since then it runs every second. As you can see the run next time is stuck in the past and won't be updated correctly.

command_schedule.png

Hope you have a resolution, obviously I won't update the production cluster for now

.

Kind regards....Hans

jmichaelson · Post by **jmichaelson** » Thu Apr 10, 2025 3:45 pm

Hi @CBoekhuis, thanks for reaching out.

This is related to #17 in the changelog for R1.3.2. We're working on a fix for it.

CBoekhuis · Post by **CBoekhuis** » Fri Apr 11, 2025 3:17 am

Thanks for the update, I'm looking forward for the fix

.

jmichaelson · Post by **jmichaelson** » Mon Apr 14, 2025 1:30 pm

Hi @CBoekhuis

we have a fix for this, and we'd like to request that you open a support case via the customer support portal at https://support.nagios.com/ so we can officially get the fix to you.

CBoekhuis · Post by **CBoekhuis** » Mon Apr 14, 2025 4:58 pm

Thanks Jason, I just created the case.

jmichaelson · Post by **jmichaelson** » Thu Apr 17, 2025 2:08 pm

Hi @ CBoekhuis, just checking in to see if you got the hot fix for this from support?

CBoekhuis · Post by **CBoekhuis** » Fri Apr 18, 2025 1:47 am

Hi, yes I did get the fix and can confirm it fixed the issue.
This topic can be closed

jmichaelson · Post by **jmichaelson** » Fri Apr 18, 2025 12:37 pm

Good to know. Thank you much!

Nagios Support Forum

command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade

Re: command subsystem (job scheduling) broken in 2024R1.3.2 upgrade