Eval extended but data collection stopped - why?

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
joe.ward
Posts: 44
Joined: Tue Aug 06, 2013 10:44 am

Eval extended but data collection stopped - why?

Post by joe.ward »

Nagios XI evaluation was to expired 10/6. Contacted sales and they sent 30-day extension. Installed it last week and immediately the 30-day countdown was displayed in Nagios XI. Enterprise extension has 11 days left on original eval.

Rebooting the server did not straighten things out.

Nagios stopped saving data and sending alerts on Sunday 10/6 around 20:00 EDT. Why?
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Eval extended but data collection stopped - why?

Post by slansing »

Installed it last week and immediately the 30-day countdown was displayed in Nagios XI. Enterprise extension has 11 days left on original eval.
It sounds like what you had asked for is working properly, are you trying to extend enterprise edition? If so you will need to contact sales again and ask their help in this. It looks like you just extended XI Base.
joe.ward
Posts: 44
Joined: Tue Aug 06, 2013 10:44 am

Re: Eval extended but data collection stopped - why?

Post by joe.ward »

The hosts do not have any performance data since 20:00 EDT on 10/6. The performance charts are empty.

The Enterprise evaluation has 11 days left according to the license information page. Even if Enterprise license expires, the performance data should still be captured shouldn't it? Data was available before I activated the Enterprise evaluation.

In CCM, I pick hosts or services and "apply" to write the configuration files again, then it never completes. It says it is waiting for the verification (2nd of the three steps). I can abort, go to Tools, Write Configuration, Verify and it passes without issue.

System Status shows
Green for: Monitoring Engine, Performance Grapher, Database Backend
Red for : Database Maintenance, Command Subsustem, Event Manager, Feed Processor, Report Engine, Cleaner, Nonstop Operations Manager, System Statistics.

When the Enterprise evaluation expires, I will contact Sales about extending.

Thank you
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: Eval extended but data collection stopped - why?

Post by sreinhardt »

You are correct, there is no reason that I know of, that adding an extension key would stop perfdata or your configurations. It seems like some other things may be going on though, than just the entry of that key. Let's manually check some service statuses and such to see what might be going on.

Code: Select all

service npcd status
service ndo2db status
ll /usr/local/nagios/var/spool/perfdata/
ll /usr/local/nagios/var/spool/checkresults/
ll /usr/local/nagios/var/spool/xidpe/
ll -R /usr/local/nagios/share/perfdata/
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
joe.ward
Posts: 44
Joined: Tue Aug 06, 2013 10:44 am

Re: Eval extended but data collection stopped - why?

Post by joe.ward »

The results are attached:
You do not have the required permissions to view the files attached to this post.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Eval extended but data collection stopped - why?

Post by abrist »

You have a large number of checkresults that are not getting reaped. Remove them if at all possible:

Code: Select all

rm -f /usr/local/nagios/var/spool/checkresults/*
Restart npcd:

Code: Select all

service npcd start
These probably spooled up when the eval ran out.
Let us know if you cannot delete due to stat() issues.
Additionally, lets check your php.ini limits:

Code: Select all

grep "memory_limit\|max_execution\|max_input" /etc/php.ini
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
joe.ward
Posts: 44
Joined: Tue Aug 06, 2013 10:44 am

Re: Eval extended but data collection stopped - why?

Post by joe.ward »

Removed the spool files

Restarted NCPD

NO relief :-(

When I go to a host and pick a service, schedule an immediate check, it comes back that the server is busy (see attachment)

What values should php.ini have? The server has 8 GB RAM.
From php.ini
; max_input_time
max_execution_time = 30
max_input_time = 60
;max_input_nesting_level = 64
memory_limit = 128M

Perhaps the attached screenshots will help.
You do not have the required permissions to view the files attached to this post.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Eval extended but data collection stopped - why?

Post by slansing »

Is cron running?

Code: Select all

service crond status
If not, or even if it is, run:

Code: Select all

service crond restart
and try to schedule an immediate check again, then send us the output of this command:

Code: Select all

tail -50 /var/log/cron
joe.ward
Posts: 44
Joined: Tue Aug 06, 2013 10:44 am

Re: Eval extended but data collection stopped - why?

Post by joe.ward »

The cron log file had many lines about PAM Authentication failure. Turns out the login for the 'nagios' user had expired! I set it to non-expiring and received a lot of queued alerts.

Alerts still trigger email but the performance graphs are not showing information now, several hours since fixing the nagios login password.

What else can I check to restore the performance graphs?
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Eval extended but data collection stopped - why?

Post by slansing »

Can you check the following again please?:

Code: Select all

service npcd status
service ndo2db status
ll /usr/local/nagios/var/spool/perfdata/
ll /usr/local/nagios/var/spool/checkresults/
ll /usr/local/nagios/var/spool/xidpe/
ll -R /usr/local/nagios/share/perfdata/
And also the output of:

Code: Select all

tail -30 /usr/local/nagios/var/perfdata.log
Locked