check_http is stuck in pending.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: check_http is stuck in pending.
I've been trying to replicate this (unsuccessfully), can you tell me when you add a new check_http service are you using a wizard, doing it through the CCM or other?
Re: check_http is stuck in pending.
I have been using the CCM.scottwilkerson wrote:I've been trying to replicate this (unsuccessfully), can you tell me when you add a new check_http service are you using a wizard, doing it through the CCM or other?
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: check_http is stuck in pending.
Can I have you try running the standard website wizard for one to see if they are working properly.
Thanks
Thanks
Re: check_http is stuck in pending.
That's kind of a problem because I removed the services associated with the Nagios XI wizzard... Is there a way to get them back without reinstalling?scottwilkerson wrote:Can I have you try running the standard website wizard for one to see if they are working properly.
Thanks
Or when I reinstall, is there a way to import my current settings without removing existing things?
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: check_http is stuck in pending.
You can restore the default commands and templates by doing the following, unzip attached file and copy xiwzardtemplates.cfg to
/usr/local/nagios/etc/import
run the following from the command line
/usr/local/nagios/etc/import
run the following from the command line
Code: Select all
chown nagios:nagios /usr/local/nagios/etc/import/xiwzardtemplates.cfg
cd /usr/local/nagiosxi/scripts
./reconfigure_nagios.shYou do not have the required permissions to view the files attached to this post.
Re: check_http is stuck in pending.
We had similar issues on several occasions and they were all related to "corrupt" NDO database.
Some hosts and services were shown in PENDING state, services would have notifications disabled at random etc.
Applying new configuration from the CCM would not help, since the historic data in NDO database is still present.
This issue was very hard to debug and I haven't really figured out what was the cause.
However, I did find some correlations - it occurred more often when upgrading to a new version of Nagios XI, when multiple Nagios servers are writing to a single NDO database (extremely uncommon setup and done for highly specific purposes) or when the MySQL database is replicated between multiple nodes.
Regarding the solution, I first tried to manually search through the database (http://nagios.sourceforge.net/docs/ndou ... _Model.pdf is a necessary resource here) identifying and deleting "suspicious" entries. You should search both historic and current tables.
It's difficult to tell what entries are suspicious in general, though:(
After some time I found it too tiresome and I usually fixed this by restoring a recent database backup...
I know this is not a direct solution, but I hope you find at least some information here useful:)
Some hosts and services were shown in PENDING state, services would have notifications disabled at random etc.
Applying new configuration from the CCM would not help, since the historic data in NDO database is still present.
This issue was very hard to debug and I haven't really figured out what was the cause.
However, I did find some correlations - it occurred more often when upgrading to a new version of Nagios XI, when multiple Nagios servers are writing to a single NDO database (extremely uncommon setup and done for highly specific purposes) or when the MySQL database is replicated between multiple nodes.
Regarding the solution, I first tried to manually search through the database (http://nagios.sourceforge.net/docs/ndou ... _Model.pdf is a necessary resource here) identifying and deleting "suspicious" entries. You should search both historic and current tables.
It's difficult to tell what entries are suspicious in general, though:(
After some time I found it too tiresome and I usually fixed this by restoring a recent database backup...
I know this is not a direct solution, but I hope you find at least some information here useful:)
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: check_http is stuck in pending.
this is what I was thinking too and why I suggested repairing the DB back herelmilkovic wrote:We had similar issues on several occasions and they were all related to "corrupt" NDO database.
Some hosts and services were shown in PENDING state, services would have notifications disabled at random etc.
Applying new configuration from the CCM would not help, since the historic data in NDO database is still present.
This issue was very hard to debug and I haven't really figured out what was the cause.
However, I did find some correlations - it occurred more often when upgrading to a new version of Nagios XI, when multiple Nagios servers are writing to a single NDO database (extremely uncommon setup and done for highly specific purposes) or when the MySQL database is replicated between multiple nodes.
Regarding the solution, I first tried to manually search through the database (http://nagios.sourceforge.net/docs/ndou ... _Model.pdf is a necessary resource here) identifying and deleting "suspicious" entries. You should search both historic and current tables.
It's difficult to tell what entries are suspicious in general, though:(
After some time I found it too tiresome and I usually fixed this by restoring a recent database backup...
I know this is not a direct solution, but I hope you find at least some information here useful:)
http://support.nagios.com/forum/viewtop ... 211#p28092
Re: check_http is stuck in pending.
Sorry Scott, I haven't seen that post:(
However, in my experience, the database was not corrupted in "MySQL-style-corruption", i.e. all tables were functional, no indexes were corrupt etc. The database structure and internals were working fine, but some "ghost" entries and other suspicious entries were poisoning the database and causing the issues similar to the one described here. I had to manually delete those entries (for example: entries for the same check and same check time, but containing different results/textual descriptions).
It is ofcourse recommended to run the repairmysql script as well, as you advised.
However, in my experience, the database was not corrupted in "MySQL-style-corruption", i.e. all tables were functional, no indexes were corrupt etc. The database structure and internals were working fine, but some "ghost" entries and other suspicious entries were poisoning the database and causing the issues similar to the one described here. I had to manually delete those entries (for example: entries for the same check and same check time, but containing different results/textual descriptions).
It is ofcourse recommended to run the repairmysql script as well, as you advised.
[SOLVED]check_http is stuck in pending.
Turns out that using the NagiosXI wizzard to create this services fixes the problem. No explanation why... But it works!scottwilkerson wrote:You can restore the default commands and templates by doing the following, unzip attached file and copy xiwzardtemplates.cfg to
/usr/local/nagios/etc/import
run the following from the command lineCode: Select all
chown nagios:nagios /usr/local/nagios/etc/import/xiwzardtemplates.cfg cd /usr/local/nagiosxi/scripts ./reconfigure_nagios.sh
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: check_http is stuck in pending.
Excellent! 