Several graphs not working after manual build/install

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Several graphs not working after manual build/install

Post by scottwilkerson »

lynrob wrote:Hi Scott,
Sorry for the late reply I didn't have email notifications turned on like I thought I did... Those fields are all blank in the modify hosts screen. That is because the icon is inherited from a template. I tried setting the fields directly in the modify host config but it didn't help:
icon image: polycom.com
icon image alt text: polycom
status image: polycom.com
We need to update the documentation, the images need to also be copied to the following directory

Code: Select all

/usr/local/nagiosxi/html/includes/components/nagioscore/ui/images/logos
lynrob wrote: Back to a previous topic, any thoughts as to why for host graphs and graph explorer the only graphs displayed are for those nodes added after I restored NagiosXI on the new server? All nodes that were covered from the back up display no graphs.

Thanks again for all of your help.
Off the top of my head I don't have an answer for this, but for clarity, what method did you use to make and restore the backup?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
lynrob
Posts: 30
Joined: Wed Sep 21, 2011 12:29 pm

Re: Several graphs not working after manual build/install

Post by lynrob »

Ok, copying the icon file to the new directory worked!! Another problem solved.

The method I used to backup and restore is the one listed in the NagiosXI documentation using the backup_xi and restore_xi scripts. I checked the /usr/local/nagios/share/perfdata directory to see if the older graphs were being generated and they are... so the graphs exist but they are not displayed for some reason. I compared two similar nodes one added before the backup/restore and one added after. Both have the exact same perf data files updated periodically as one would expect. The only difference is graphs for node that was restored are not displayed. Hopefully that gives you something to go on. Thanks again for all of your help.
lynrob
Posts: 30
Joined: Wed Sep 21, 2011 12:29 pm

Re: Several graphs not working after manual build/install

Post by lynrob »

Hi Scott, FWIW, I just completely deleted one of the old nodes and then added it back in. Unfortunately, this node still does not have its graphs displayed on the "host graphs" page. So newly added nodes work fine but deleted and re-added nodes fails to display.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Several graphs not working after manual build/install

Post by scottwilkerson »

I'm going to have a senior developer take a look to see if he may know what could be causing this.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Several graphs not working after manual build/install

Post by mguthrie »

For the Graph Explorer, there was a bug in the initial version that prevented some graphs from showing up. Check the Admin->Manage Components page and see if you have the 1.1 version of the Graph Explorer, if not go ahead and download the newest version and install it.

For the "All Host Graphs" page, can you check the permissions on the graphs that you're not seeing. Just for the sake of debugging, can you run the following to make sure that XI can access the graphs ok.

Code: Select all

chown -R nagios.nagios /usr/local/nagios/share/perfdata
chmod -R 777 /usr/local/nagios/share/perfdata
If that doesn't work, take a look at the end of the following logs, and see if there are any errors in them:

Code: Select all

/usr/local/nagios/var/perfdata.log
/usr/local/nagios/var/npcd.log
lynrob
Posts: 30
Joined: Wed Sep 21, 2011 12:29 pm

Re: Several graphs not working after manual build/install

Post by lynrob »

I checked graph explorer and we are on version 1.0. The updated version of graph explorer appears to be in a "subscriber's only" area of your website. I am not yet a subscriber since our legal dept is still working through the licensing agreement with your sales team. Hopefully the issues will be worked out and we will purchase a license soon.

regarding "all host graphs", I checked the permissions and they all appeared ok but I ran the commands you recommended anyway. No change. I looked at the logs and there are no errors. Here is the last entry in the /usr/local/nagios/var/perfdata.log file:
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: Timeout after 5 secs. ***
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: Please check your npcd.cfg
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//host-perfdata.1321835758-PID-13450 deleted
2011-11-20 16:36:08 [13450] [0] *** Timeout while processing Host: "cv-hp4001-pwra" Service: "_HOST_"
2011-11-20 16:36:08 [13450] [0] *** process_perfdata.pl terminated on signal ALRM

That was last Friday so I don't think that is the issue. I can see activity in the spool directory as files are added, processed and removed. The npcd log also shows no sign of problems. Here is an excerpt of the end of the log:
[01-23-2012 16:04:55] NPCD: Found 4 files in /usr/local/nagios/var/spool/perfdata/
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 0/5 File is .
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 0/5 File is ..
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 0/5 File is host-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Regular File: host-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: A thread was started on thread_counter = 0
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 1/5 File is service-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Regular File: service-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: A thread was started on thread_counter = 1
[01-23-2012 16:04:55] NPCD: Processing file host-perfdata.1327363478 with ID 139998720673536 - going to exec /usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//host-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Have to wait: Filecounter = 2 - thread_counter = 2
[01-23-2012 16:04:55] NPCD: Processing file 'host-perfdata.1327363478'
[01-23-2012 16:04:55] NPCD: Processing file service-perfdata.1327363478 with ID 139998710183680 - going to exec /usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//service-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Processing file 'service-perfdata.1327363478'
[01-23-2012 16:04:55] NPCD: No more files to process... waiting for 15 seconds

The perfdata files are there, they just don't get displayed in XI. Following is an example of 2 nearly identical switches one displays one does not. Here are the perfdata directory listings:
ll /usr/local/nagios/share/perfdata/cor2-b10*
cor2-b10ilo-sw1:
total 1144
drwxrwxrwx 2 nagios nagios 4096 Jan 23 16:29 ./
drwxrwxrwx 114 nagios nagios 4096 Jan 20 11:14 ../
-rwxrwxrwx 1 nagios nagios 384736 Jan 23 16:29 SSH_Server.rrd*
-rw-rw-rw- 1 nagios nagios 2189 Jan 23 16:29 SSH_Server.xml
-rwxrwxrwx 1 nagios nagios 768008 Jan 23 16:26 _HOST_.rrd*
-rw-rw-rw- 1 nagios nagios 2651 Jan 23 16:26 _HOST_.xml

cor2-b10lync-sw1:
total 1144
drwxrwxrwx 2 nagios nagios 4096 Jan 23 16:28 ./
drwxrwxrwx 114 nagios nagios 4096 Jan 20 11:14 ../
-rwxrwxrwx 1 nagios nagios 384952 Jan 23 16:28 SSH_Server.rrd*
-rw-rw-rw- 1 nagios nagios 2169 Jan 23 16:28 SSH_Server.xml
-rwxrwxrwx 1 nagios nagios 768224 Jan 23 16:26 _HOST_.rrd*
-rw-rw-rw- 1 nagios nagios 2632 Jan 23 16:26 _HOST_.xml

They seem virtually identical but here are how they are displayed in the browser:
The first URL is: http://cor2-nagios.hhven.net/nagiosxi/p ... te=&mode=1
b10ilo-sw1.JPG
The second URL is: http://cor2-nagios.hhven.net/nagiosxi/p ... te=&mode=1
b10lync-sw1.JPG
I hope this helps. Thanks for your continued help!!

Lyn
You do not have the required permissions to view the files attached to this post.
Locked