Page 1 of 3
Eval extended but data collection stopped - why?
Posted: Tue Oct 08, 2013 12:01 am
by joe.ward
Nagios XI evaluation was to expired 10/6. Contacted sales and they sent 30-day extension. Installed it last week and immediately the 30-day countdown was displayed in Nagios XI. Enterprise extension has 11 days left on original eval.
Rebooting the server did not straighten things out.
Nagios stopped saving data and sending alerts on Sunday 10/6 around 20:00 EDT. Why?
Re: Eval extended but data collection stopped - why?
Posted: Tue Oct 08, 2013 9:33 am
by slansing
Installed it last week and immediately the 30-day countdown was displayed in Nagios XI. Enterprise extension has 11 days left on original eval.
It sounds like what you had asked for is working properly, are you trying to extend enterprise edition? If so you will need to contact sales again and ask their help in this. It looks like you just extended XI Base.
Re: Eval extended but data collection stopped - why?
Posted: Tue Oct 08, 2013 9:56 am
by joe.ward
The hosts do not have any performance data since 20:00 EDT on 10/6. The performance charts are empty.
The Enterprise evaluation has 11 days left according to the license information page. Even if Enterprise license expires, the performance data should still be captured shouldn't it? Data was available before I activated the Enterprise evaluation.
In CCM, I pick hosts or services and "apply" to write the configuration files again, then it never completes. It says it is waiting for the verification (2nd of the three steps). I can abort, go to Tools, Write Configuration, Verify and it passes without issue.
System Status shows
Green for: Monitoring Engine, Performance Grapher, Database Backend
Red for : Database Maintenance, Command Subsustem, Event Manager, Feed Processor, Report Engine, Cleaner, Nonstop Operations Manager, System Statistics.
When the Enterprise evaluation expires, I will contact Sales about extending.
Thank you
Re: Eval extended but data collection stopped - why?
Posted: Tue Oct 08, 2013 4:35 pm
by sreinhardt
You are correct, there is no reason that I know of, that adding an extension key would stop perfdata or your configurations. It seems like some other things may be going on though, than just the entry of that key. Let's manually check some service statuses and such to see what might be going on.
Code: Select all
service npcd status
service ndo2db status
ll /usr/local/nagios/var/spool/perfdata/
ll /usr/local/nagios/var/spool/checkresults/
ll /usr/local/nagios/var/spool/xidpe/
ll -R /usr/local/nagios/share/perfdata/
Re: Eval extended but data collection stopped - why?
Posted: Wed Oct 09, 2013 9:53 am
by joe.ward
The results are attached:
Re: Eval extended but data collection stopped - why?
Posted: Wed Oct 09, 2013 4:24 pm
by abrist
You have a large number of checkresults that are not getting reaped. Remove them if at all possible:
Code: Select all
rm -f /usr/local/nagios/var/spool/checkresults/*
Restart npcd:
These probably spooled up when the eval ran out.
Let us know if you cannot delete due to stat() issues.
Additionally, lets check your php.ini limits:
Code: Select all
grep "memory_limit\|max_execution\|max_input" /etc/php.ini
Re: Eval extended but data collection stopped - why?
Posted: Thu Oct 10, 2013 8:51 am
by joe.ward
Removed the spool files
Restarted NCPD
NO relief
When I go to a host and pick a service, schedule an immediate check, it comes back that the server is busy (see attachment)
What values should php.ini have? The server has 8 GB RAM.
From php.ini
; max_input_time
max_execution_time = 30
max_input_time = 60
;max_input_nesting_level = 64
memory_limit = 128M
Perhaps the attached screenshots will help.
Re: Eval extended but data collection stopped - why?
Posted: Thu Oct 10, 2013 4:18 pm
by slansing
Is cron running?
If not, or even if it is, run:
and try to schedule an immediate check again, then send us the output of this command:
Re: Eval extended but data collection stopped - why?
Posted: Fri Oct 11, 2013 9:57 am
by joe.ward
The cron log file had many lines about PAM Authentication failure. Turns out the login for the 'nagios' user had expired! I set it to non-expiring and received a lot of queued alerts.
Alerts still trigger email but the performance graphs are not showing information now, several hours since fixing the nagios login password.
What else can I check to restore the performance graphs?
Re: Eval extended but data collection stopped - why?
Posted: Fri Oct 11, 2013 10:32 am
by slansing
Can you check the following again please?:
Code: Select all
service npcd status
service ndo2db status
ll /usr/local/nagios/var/spool/perfdata/
ll /usr/local/nagios/var/spool/checkresults/
ll /usr/local/nagios/var/spool/xidpe/
ll -R /usr/local/nagios/share/perfdata/
And also the output of:
Code: Select all
tail -30 /usr/local/nagios/var/perfdata.log