Blank space in graphs

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
cwscribner
Posts: 316
Joined: Thu Mar 31, 2011 9:54 am
Location: Patten, ME
Contact:

Re: Blank space in graphs

Post by cwscribner »

Got it. So basically I should leave everything to the default...

Any thoughts on whether or not this would be considered solved?
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Blank space in graphs

Post by scottwilkerson »

I would suggest trying reducing the sleep time to 10.

As far as resolved, I think it will only really be resolved once you get your new hardware ;)
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
cwscribner
Posts: 316
Joined: Thu Mar 31, 2011 9:54 am
Location: Patten, ME
Contact:

Re: Blank space in graphs

Post by cwscribner »

Just wanted to follow up on this.

We added a second 8 core CPU for a total of 16 cores. The loads hover at about half of what they did previously; around 7-10 now. Unfortunately, I'm still seeing timeouts a lot of errors in npcd.log and many graphs with blank spots.

The most recent 500 lines look like this...

Code: Select all

[02-23-2012 09:40:49] NPCD: ERROR: Executed command exits with return code '7' [02-23-2012 09:40:49] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//service-perfdata.1330008018'
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Blank space in graphs

Post by scottwilkerson »

The return code 7 is a timeout, what is the TIMEOUT set to in
/usr/local/nagios/etc/pnp/process_perfdata.cfg ?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
cwscribner
Posts: 316
Joined: Thu Mar 31, 2011 9:54 am
Location: Patten, ME
Contact:

Re: Blank space in graphs

Post by cwscribner »

5
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Blank space in graphs

Post by scottwilkerson »

That should be long enough...

How many files do we have in the folder?

Code: Select all

ls /usr/local/nagios/var/spool/perfdata | wc -l
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
cwscribner
Posts: 316
Joined: Thu Mar 31, 2011 9:54 am
Location: Patten, ME
Contact:

Re: Blank space in graphs

Post by cwscribner »

497089
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Blank space in graphs

Post by mguthrie »

Hmm, those files are supposed to be reaped and cleaned up every few seconds. From here you have two options, you can try increasing the timeout quite a bit to give each thread more time to scan through the directory and see if it can slowly get that file count back down to normal, or you can just clear that directory and watch to verify that it's cleaning up the files on a regular basis. There shouldn't be more than a handful of files in there at a time, and they should be getting deleted every 15 seconds after they've been processed.

[EDIT]
I did just check the PNP changelog and it looks like in the latest version they increased the process_perfdata timeout to 15 seconds as the default value instead of 5. I'd suggest trying that on your system since it's a larger install.
cwscribner
Posts: 316
Joined: Thu Mar 31, 2011 9:54 am
Location: Patten, ME
Contact:

Re: Blank space in graphs

Post by cwscribner »

I deleted all of the files, and now when I check the file count it fluctuates between 0-4 files; so it is indeed reaping files. I also increased the timeout to 15. I'll update in a few days after its had some chance to grab data.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Blank space in graphs

Post by mguthrie »

Thanks for the update. Let us know how it goes, we've had this happen on a few larger installs so it'd be good to know how we can prevent this for users in the future.
Locked