Strange recheck timing issue

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
askewdread
Posts: 69
Joined: Wed Nov 16, 2016 4:54 pm

Re: Strange recheck timing issue

Post by askewdread »

kyang wrote:Thanks, could you also tell me the time on XI?

Timezone settings.

http://yourXIipAddress/nagiosxi/admin/globalconfig.php
timezone.PNG
You do not have the required permissions to view the files attached to this post.
kyang

Re: Strange recheck timing issue

Post by kyang »

Let's try checking your Core GUI.
http://ipaddress/nagios

Does it also show the wrong date format here on your checks or is it correct? (Screenshots would be nice)
if you force a check it will then go and run it properly and schedule the next one correctly
You mentioned this, and your check_interval is every 3 minutes.
If you wait 3 minutes, does the check actual work or no?

Could you send a screen of this page?

Admin > System Settings --> Defaults tab

Thanks.
askewdread
Posts: 69
Joined: Wed Nov 16, 2016 4:54 pm

Re: Strange recheck timing issue

Post by askewdread »

kyang wrote:Let's try checking your Core GUI.
http://ipaddress/nagios

Does it also show the wrong date format here on your checks or is it correct? (Screenshots would be nice)
if you force a check it will then go and run it properly and schedule the next one correctly
You mentioned this, and your check_interval is every 3 minutes.
If you wait 3 minutes, does the check actual work or no?

Could you send a screen of this page?

Admin > System Settings --> Defaults tab

Thanks.
core shows the same,
nagios-old.PNG
these ones are 3 minutes correct, but no, they never recheck, as you can see in the screenshots last check was days ago, and next check isn't going to be for months.
usersettings.PNG
You do not have the required permissions to view the files attached to this post.
kyang

Re: Strange recheck timing issue

Post by kyang »

Could you PM or post your status.dat

Code: Select all

/usr/local/nagios/var/status.dat
After you retrieve the status.dat file, you can then run the repair_databases script on your db server.

Code: Select all

cd /usr/local/nagiosxi/scripts
./repair_databases.sh
Let us know if that fixes the issue or not.
askewdread
Posts: 69
Joined: Wed Nov 16, 2016 4:54 pm

Re: Strange recheck timing issue

Post by askewdread »

kyang wrote:Could you PM or post your status.dat

Code: Select all

/usr/local/nagios/var/status.dat
After you retrieve the status.dat file, you can then run the repair_databases script on your db server.

Code: Select all

cd /usr/local/nagiosxi/scripts
./repair_databases.sh
Let us know if that fixes the issue or not.
have pm'd the status.dat file, we have it in a ramdisk so path was slightly different :)

that has fixed those ones....that's odd
kyang

Re: Strange recheck timing issue

Post by kyang »

Ah ok! Thanks.

So there are no check timing issues anymore?
Last edited by kyang on Wed Feb 07, 2018 11:53 am, edited 1 time in total.
Reason: dat file shared with techs!
askewdread
Posts: 69
Joined: Wed Nov 16, 2016 4:54 pm

Re: Strange recheck timing issue

Post by askewdread »

kyang wrote:Ah ok! Thanks.

So there are no check timing issues anymore?
not currently no.... but from what I can see all the db repair did was restart the engine as none were corrupt... and restarting cant be an acceptable solution as 16 checks hadn't checked in 4 days.... which is what we rely on to know our business is working....
askewdread
Posts: 69
Joined: Wed Nov 16, 2016 4:54 pm

Re: Strange recheck timing issue

Post by askewdread »

kyang wrote:Ah ok! Thanks.

So there are no check timing issues anymore?
havejust found 3 new services and 1 host doing the same as yesterday

this one hasn't checked in nearly 12 hours and clearly shows in the screenshot that last check and next check are months apart
newerrror.PNG
You do not have the required permissions to view the files attached to this post.
kyang

Re: Strange recheck timing issue

Post by kyang »

Let's truncate some tables in your db.

Run these 3 commands.

Code: Select all

mysql -u ndoutils -pn@gweb nagiosxi -e 'TRUNCATE TABLE xi_meta'
mysql -u ndoutils -pn@gweb nagiosxi -e 'TRUNCATE TABLE xi_events'
mysql -u ndoutils -pn@gweb nagiosxi -e 'TRUNCATE TABLE xi_eventqueue'
Then re-run the repair_databases.sh script.

Code: Select all

cd /usr/local/nagiosxi/scripts
./repair_databases.sh
Let us know how the recheck timing is after that.
askewdread
Posts: 69
Joined: Wed Nov 16, 2016 4:54 pm

Re: Strange recheck timing issue

Post by askewdread »

kyang wrote:Let's truncate some tables in your db.

Run these 3 commands.

Code: Select all

mysql -u ndoutils -pn@gweb nagiosxi -e 'TRUNCATE TABLE xi_meta'
mysql -u ndoutils -pn@gweb nagiosxi -e 'TRUNCATE TABLE xi_events'
mysql -u ndoutils -pn@gweb nagiosxi -e 'TRUNCATE TABLE xi_eventqueue'
Then re-run the repair_databases.sh script.

Code: Select all

cd /usr/local/nagiosxi/scripts
./repair_databases.sh
Let us know how the recheck timing is after that.
it had fixed all the ones at the time but again the issue has returned this morning
3rdtime.PNG
You do not have the required permissions to view the files attached to this post.
Locked