Page 1 of 2

Added host but no data. Tests pass.

Posted: Sat Oct 14, 2017 7:42 am
by dfmco
I have a Nagios server on the same subnet as the Fusion server but I am unable to get any data from it.

I can telnet to port 80 and port 443 and there are no firewall rules blocking.

My auth_subsys.log contains this same line thousands of times:
PHP Notice: Undefined variable: auth_interval in /usr/local/nagiosfusion/cron/auth_subsys.php on line 25
That file is pertty much blank (just a header at the top and @ signs on each line below).

I have several other server that work fine. I have removed and re-added the problem server with no luck.

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 9:00 am
by eloyd
I have no suggestions but I'm replying to this because I'm interested in the outcome and I track dozens of forum postings. So it's easier for me to see the ones I've replied to rather than just the ones I'm tracking. :-)

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 10:06 am
by kyang
Are the other servers that work fine also on the same subnet?

What version of fusion are you on? 4.0.1?

Can you post the output of the command below?

Code: Select all

cat /etc/cron.d/nagiosfusion

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 10:11 am
by dfmco
So I updated to 4.01 and now all of my servers are offline! Only 1 was not reporting beforehand. I get "no servers have checked in with data yet" for all of them now.

#
# /etc/cron.d/nagiosfusion: crontab fragment for nagiosfusion
#

*/5 * * * * nagios /usr/bin/php -q /usr/local/nagiosfusion/cron/dbmaint_subsys.php >>/usr/local/nagiosfusion/var/log/dbmaint_subsys.log 2>&1

* * * * * nagios /usr/bin/php -q /usr/local/nagiosfusion/cron/auth_subsys.php --max-time=60 >>/usr/local/nagiosfusion/var/log/auth_subsys.log 2>&1
* * * * * nagios /usr/bin/php -q /usr/local/nagiosfusion/cron/cmd_subsys.php --max-time=60 >>/usr/local/nagiosfusion/var/log/cmd_subsys.log 2>&1
* * * * * nagios /usr/bin/php -q /usr/local/nagiosfusion/cron/log_subsys.php --max-time=60 >>/usr/local/nagiosfusion/var/log/log_subsys.log 2>&1
* * * * * nagios /usr/bin/php -q /usr/local/nagiosfusion/cron/poll_subsys.php --max-time=60 --master-poll >>/usr/local/nagiosfusion/var/log/poll_subsys.log 2>&1
* * * * * nagios /usr/bin/php -q /usr/local/nagiosfusion/cron/sysstat_subsys.php --max-time=60 >>/usr/local/nagiosfusion/var/log/sysstat_subsys.log 2>&1

Before the upgrade, I was also noticing that none of the data was current for any of the monitored servers. The data was hours or weeks old.

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 10:21 am
by kyang
Can you also post this output for these?

Code: Select all

tail -n 100 /usr/local/nagiosfusion/var/log/poll* >/tmp/info.txt
ps -ef --cols=300 >>/tmp/info.txt
Is your fusion server in the correct timezone? --> Home --> nagiosadmin --> Account Preferences --> Timezone

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 10:33 am
by dfmco
File attached. We are in the correct time zone and the server time is correct.

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 2:44 pm
by bheden
My auth_subsys.log contains this same line thousands of times:
PHP Notice: Undefined variable: auth_interval in /usr/local/nagiosfusion/cron/auth_subsys.php on line 25
That file is pertty much blank (just a header at the top and @ signs on each line below).
Which file is pretty much blank? What @ signs?

Your poll subsys logs look a bit wonky to me. First - let's remove the original server that was having the problem.

Then, run the following command:

Code: Select all

/usr/local/nagiosfusion/scripts/truncate_polled.php
Did you happen to tweak your auth interval settings for any of the servers? I am the Fusion4 developer and haven't seen this yet.

Anyway - once you deleted the server and truncated the polled tables (in that order) - do you see data?

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 3:19 pm
by dfmco
That fixed the ones that were broken before. Is there anything I should try before adding the problem child back to the mix? This was an out of the box install (OVA) and I added all of the servers at the same time. The problem child always showed all zeros in the tactical overview but in some of the reports, it looked like it saw stuff was down but it would not report anything on the tactical overview (not even host and service counts). I removed and re-added but the problem did not change.

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 4:16 pm
by dwhitfield
dfmco wrote:So I updated to 4.01 and now all of my servers are offline!
You say later in the thread that this was an OVA. Did you actually upgrade the same system or did you download a new OVA?

I don't think there's anything to do before adding the problematic one back. Did you double-check the username and password for that server?

Re: Added host but no data. Tests pass.

Posted: Mon Oct 16, 2017 4:27 pm
by dfmco
I upgraded from 4.0.0 to 4.0.1 using the CLI.

I added it back in and it works now. Since I added and removed on 4.00 and that did not fix the issue, was there any patch in 4.01 that made it work this time? Just wanted to know for future reference.

Thanks for your help!