Page 3 of 3

Re: Several graphs not working after manual build/install

Posted: Tue Jan 17, 2012 11:46 am
by scottwilkerson
lynrob wrote:Hi Scott,
Sorry for the late reply I didn't have email notifications turned on like I thought I did... Those fields are all blank in the modify hosts screen. That is because the icon is inherited from a template. I tried setting the fields directly in the modify host config but it didn't help:
icon image: polycom.com
icon image alt text: polycom
status image: polycom.com
We need to update the documentation, the images need to also be copied to the following directory

Code: Select all

/usr/local/nagiosxi/html/includes/components/nagioscore/ui/images/logos
lynrob wrote: Back to a previous topic, any thoughts as to why for host graphs and graph explorer the only graphs displayed are for those nodes added after I restored NagiosXI on the new server? All nodes that were covered from the back up display no graphs.

Thanks again for all of your help.
Off the top of my head I don't have an answer for this, but for clarity, what method did you use to make and restore the backup?

Re: Several graphs not working after manual build/install

Posted: Tue Jan 17, 2012 5:25 pm
by lynrob
Ok, copying the icon file to the new directory worked!! Another problem solved.

The method I used to backup and restore is the one listed in the NagiosXI documentation using the backup_xi and restore_xi scripts. I checked the /usr/local/nagios/share/perfdata directory to see if the older graphs were being generated and they are... so the graphs exist but they are not displayed for some reason. I compared two similar nodes one added before the backup/restore and one added after. Both have the exact same perf data files updated periodically as one would expect. The only difference is graphs for node that was restored are not displayed. Hopefully that gives you something to go on. Thanks again for all of your help.

Re: Several graphs not working after manual build/install

Posted: Thu Jan 19, 2012 7:03 pm
by lynrob
Hi Scott, FWIW, I just completely deleted one of the old nodes and then added it back in. Unfortunately, this node still does not have its graphs displayed on the "host graphs" page. So newly added nodes work fine but deleted and re-added nodes fails to display.

Re: Several graphs not working after manual build/install

Posted: Fri Jan 20, 2012 5:19 pm
by scottwilkerson
I'm going to have a senior developer take a look to see if he may know what could be causing this.

Re: Several graphs not working after manual build/install

Posted: Mon Jan 23, 2012 10:20 am
by mguthrie
For the Graph Explorer, there was a bug in the initial version that prevented some graphs from showing up. Check the Admin->Manage Components page and see if you have the 1.1 version of the Graph Explorer, if not go ahead and download the newest version and install it.

For the "All Host Graphs" page, can you check the permissions on the graphs that you're not seeing. Just for the sake of debugging, can you run the following to make sure that XI can access the graphs ok.

Code: Select all

chown -R nagios.nagios /usr/local/nagios/share/perfdata
chmod -R 777 /usr/local/nagios/share/perfdata
If that doesn't work, take a look at the end of the following logs, and see if there are any errors in them:

Code: Select all

/usr/local/nagios/var/perfdata.log
/usr/local/nagios/var/npcd.log

Re: Several graphs not working after manual build/install

Posted: Mon Jan 23, 2012 7:42 pm
by lynrob
I checked graph explorer and we are on version 1.0. The updated version of graph explorer appears to be in a "subscriber's only" area of your website. I am not yet a subscriber since our legal dept is still working through the licensing agreement with your sales team. Hopefully the issues will be worked out and we will purchase a license soon.

regarding "all host graphs", I checked the permissions and they all appeared ok but I ran the commands you recommended anyway. No change. I looked at the logs and there are no errors. Here is the last entry in the /usr/local/nagios/var/perfdata.log file:
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: Timeout after 5 secs. ***
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: Please check your npcd.cfg
2011-11-20 16:36:08 [13450] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//host-perfdata.1321835758-PID-13450 deleted
2011-11-20 16:36:08 [13450] [0] *** Timeout while processing Host: "cv-hp4001-pwra" Service: "_HOST_"
2011-11-20 16:36:08 [13450] [0] *** process_perfdata.pl terminated on signal ALRM

That was last Friday so I don't think that is the issue. I can see activity in the spool directory as files are added, processed and removed. The npcd log also shows no sign of problems. Here is an excerpt of the end of the log:
[01-23-2012 16:04:55] NPCD: Found 4 files in /usr/local/nagios/var/spool/perfdata/
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 0/5 File is .
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 0/5 File is ..
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 0/5 File is host-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Regular File: host-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: A thread was started on thread_counter = 0
[01-23-2012 16:04:55] NPCD: DEBUG: load 0.500000/10.000000
[01-23-2012 16:04:55] NPCD: ThreadCounter 1/5 File is service-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Regular File: service-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: A thread was started on thread_counter = 1
[01-23-2012 16:04:55] NPCD: Processing file host-perfdata.1327363478 with ID 139998720673536 - going to exec /usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//host-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Have to wait: Filecounter = 2 - thread_counter = 2
[01-23-2012 16:04:55] NPCD: Processing file 'host-perfdata.1327363478'
[01-23-2012 16:04:55] NPCD: Processing file service-perfdata.1327363478 with ID 139998710183680 - going to exec /usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//service-perfdata.1327363478
[01-23-2012 16:04:55] NPCD: Processing file 'service-perfdata.1327363478'
[01-23-2012 16:04:55] NPCD: No more files to process... waiting for 15 seconds

The perfdata files are there, they just don't get displayed in XI. Following is an example of 2 nearly identical switches one displays one does not. Here are the perfdata directory listings:
ll /usr/local/nagios/share/perfdata/cor2-b10*
cor2-b10ilo-sw1:
total 1144
drwxrwxrwx 2 nagios nagios 4096 Jan 23 16:29 ./
drwxrwxrwx 114 nagios nagios 4096 Jan 20 11:14 ../
-rwxrwxrwx 1 nagios nagios 384736 Jan 23 16:29 SSH_Server.rrd*
-rw-rw-rw- 1 nagios nagios 2189 Jan 23 16:29 SSH_Server.xml
-rwxrwxrwx 1 nagios nagios 768008 Jan 23 16:26 _HOST_.rrd*
-rw-rw-rw- 1 nagios nagios 2651 Jan 23 16:26 _HOST_.xml

cor2-b10lync-sw1:
total 1144
drwxrwxrwx 2 nagios nagios 4096 Jan 23 16:28 ./
drwxrwxrwx 114 nagios nagios 4096 Jan 20 11:14 ../
-rwxrwxrwx 1 nagios nagios 384952 Jan 23 16:28 SSH_Server.rrd*
-rw-rw-rw- 1 nagios nagios 2169 Jan 23 16:28 SSH_Server.xml
-rwxrwxrwx 1 nagios nagios 768224 Jan 23 16:26 _HOST_.rrd*
-rw-rw-rw- 1 nagios nagios 2632 Jan 23 16:26 _HOST_.xml

They seem virtually identical but here are how they are displayed in the browser:
The first URL is: http://cor2-nagios.hhven.net/nagiosxi/p ... te=&mode=1
b10ilo-sw1.JPG
The second URL is: http://cor2-nagios.hhven.net/nagiosxi/p ... te=&mode=1
b10lync-sw1.JPG
I hope this helps. Thanks for your continued help!!

Lyn