Page 2 of 4

Re: Capacity Planning

Posted: Tue Feb 10, 2015 10:19 am
by mkosmoski
Nagios XI 2014R2.4

Re: Capacity Planning

Posted: Tue Feb 10, 2015 11:41 am
by jkinning
Nagios XI 2014R2.5

Re: Capacity Planning

Posted: Tue Feb 10, 2015 5:16 pm
by lmiltchev
jkinning, can you click on the Advanced tab and show us a screenshot of the page?

Run the following commands and show us the output:

Code: Select all

ll -d /usr/local/nagios/share/perfdata/
ll -d /usr/local/nagios/share/perfdata/ecmccs01q
ll /usr/local/nagios/share/perfdata/ecmccs01q
service npcd status
ls /usr/local/nagios/var/spool/xidpe | wc -l
ls /usr/local/nagios/var/spool/perfdata | wc -l
ls /usr/local/nagios/var/spool/checkresults | wc -l

Re: Capacity Planning

Posted: Wed Feb 11, 2015 9:16 am
by jkinning
ll -d /usr/local/nagios/share/perfdata/
drwxrwxr-x 81 nagios nagios 4096 Feb 9 13:38 /usr/local/nagios/share/perfdata/

ll -d /usr/local/nagios/share/perfdata/ecmccs01q
drwxrwxr-x 2 nagios nagios 4096 Feb 11 09:12 /usr/local/nagios/share/perfdata/ecmccs01q

ll /usr/local/nagios/share/perfdata/ecmccs01q
total 8708
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:14 CPU_Usage.rrd
-rwxrwxr-x 1 nagios nagios 1762 Jan 21 14:01 CPU_Usage.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:15 Drive_C__Disk_Usage.rrd
-rwxrwxr-x 1 nagios nagios 1912 Jan 21 14:02 Drive_C__Disk_Usage.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:15 Drive_D__Disk_Usage.rrd
-rwxrwxr-x 1 nagios nagios 1908 Jan 21 14:02 Drive_D__Disk_Usage.xml
-rwxrwxr-x 1 nagios nagios 1534768 Jan 19 15:14 _HOST_.rrd
-rw-rw-r-- 1 nagios nagios 2104 Feb 11 09:09 _HOST_.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:15 Memory_Usage.rrd
-rwxrwxr-x 1 nagios nagios 1892 Jan 21 14:03 Memory_Usage.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:16 Page_File_Usage.rrd
-rwxrwxr-x 1 nagios nagios 2013 Jan 21 14:04 Page_File_Usage.xml
-rwxrwxr-x 1 nagios nagios 1534768 Jan 19 15:17 Ping.rrd
-rwxrwxr-x 1 nagios nagios 3162 Jan 21 14:05 Ping.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:17 Server_Work_Queues.rrd
-rwxrwxr-x 1 nagios nagios 2225 Jan 21 14:05 Server_Work_Queues.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 19 15:15 Uptime.rrd
-rwxrwxr-x 1 nagios nagios 1691 Jan 21 14:02 Uptime.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:51 Windows_NonProd_C__Drive.rrd
-rw-rw-r-- 1 nagios nagios 1942 Feb 11 09:11 Windows_NonProd_C__Drive.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:52 Windows_NonProd_CPU.rrd
-rwxrwxr-x 1 nagios nagios 1817 Feb 10 12:55 Windows_NonProd_CPU.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:52 Windows_NonProd_D__Drive.rrd
-rw-rw-r-- 1 nagios nagios 1942 Feb 11 09:13 Windows_NonProd_D__Drive.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:53 Windows_NonProd_Memory_Usage.rrd
-rw-rw-r-- 1 nagios nagios 1968 Feb 11 09:12 Windows_NonProd_Memory_Usage.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:54 Windows_NonProd_Page_File_Usage.rrd
-rw-rw-r-- 1 nagios nagios 2089 Feb 11 09:09 Windows_NonProd_Page_File_Usage.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:54 Windows_NonProd_RDP.rrd
-rw-rw-r-- 1 nagios nagios 1720 Feb 11 09:11 Windows_NonProd_RDP.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:55 Windows_NonProd_Server_Work_Queue.rrd
-rw-rw-r-- 1 nagios nagios 2295 Feb 11 09:09 Windows_NonProd_Server_Work_Queue.xml
-rwxrwxr-x 1 nagios nagios 384952 Jan 21 15:53 Windows_NonProd_Uptime.rrd
-rwxrwxr-x 1 nagios nagios 1771 Feb 9 14:19 Windows_NonProd_Uptime.xml

service npcd status
NPCD running (pid 2019).

ls /usr/local/nagios/var/spool/xidpe | wc -l
0

ls /usr/local/nagios/var/spool/perfdata | wc -l
2

ls /usr/local/nagios/var/spool/checkresults | wc -l
0

Re: Capacity Planning

Posted: Wed Feb 11, 2015 10:14 am
by scottwilkerson
Can you also post the output of the following commands

Code: Select all

tail -50 /usr/local/nagios/var/npcd.log
tail -50 /usr/local/nagios/var/perfdata.log
It appears that your rrd file are not getting updated, which should be giving an error in one of these logs

Re: Capacity Planning

Posted: Wed Feb 11, 2015 11:15 am
by jkinning

Code: Select all

tail -50 /usr/local/nagios/var/npcd.log 
[02-04-2015 12:10:48] NPCD: ERROR: Executed command exits with return code '1'
[02-04-2015 12:10:48] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423069827.perfdata.service'
[02-04-2015 12:10:49] NPCD: ERROR: Executed command exits with return code '1'
[02-04-2015 12:10:49] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423069827.perfdata.host'
[02-06-2015 04:58:20] NPCD: ERROR: Executed command exits with return code '1'
[02-06-2015 04:58:20] NPCD: ERROR: Executed command exits with return code '1'
[02-06-2015 04:58:20] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423216685.perfdata.host'
[02-06-2015 04:58:20] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423216685.perfdata.service'
[02-06-2015 13:30:00] NPCD: ERROR: Executed command exits with return code '1'
[02-06-2015 13:30:00] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423247375.perfdata.service'
[02-06-2015 13:30:00] NPCD: ERROR: Executed command exits with return code '1'
[02-06-2015 13:30:00] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423247375.perfdata.host'
[02-06-2015 13:30:00] NPCD: ERROR: Executed command exits with return code '1'
[02-06-2015 13:30:00] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423247390.perfdata.host'
[02-06-2015 13:30:00] NPCD: ERROR: Executed command exits with return code '1'
[02-06-2015 13:30:00] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423247391.perfdata.service'
[02-08-2015 16:57:09] NPCD: ERROR: Executed command exits with return code '1'
[02-08-2015 16:57:09] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423432610.perfdata.service'
[02-09-2015 16:02:46] NPCD: ERROR: Executed command exits with return code '1'
[02-09-2015 16:02:46] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423515746.perfdata.service'
[02-09-2015 16:02:46] NPCD: ERROR: Executed command exits with return code '1'
[02-09-2015 16:02:46] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423515746.perfdata.host'
[02-09-2015 21:53:10] NPCD: ERROR: Executed command exits with return code '1'
[02-09-2015 21:53:10] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423536761.perfdata.host'
[02-09-2015 21:53:10] NPCD: ERROR: Executed command exits with return code '1'
[02-09-2015 21:53:10] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423536761.perfdata.service'
[02-09-2015 21:53:10] NPCD: ERROR: Executed command exits with return code '1'
[02-09-2015 21:53:10] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423536776.perfdata.host'
[02-09-2015 21:53:10] NPCD: ERROR: Executed command exits with return code '1'
[02-09-2015 21:53:10] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423536776.perfdata.service'
[02-10-2015 07:32:17] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 07:32:17] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423571501.perfdata.service'
[02-10-2015 07:32:17] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 07:32:17] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423571501.perfdata.host'
[02-10-2015 07:32:17] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 07:32:17] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423571516.perfdata.host'
[02-10-2015 07:32:17] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 07:32:17] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423571516.perfdata.service'
[02-10-2015 08:44:15] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 08:44:15] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423575836.perfdata.service'
[02-10-2015 08:44:15] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 08:44:15] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423575836.perfdata.host'
[02-10-2015 11:41:41] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 11:41:41] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423586486.perfdata.host'
[02-10-2015 11:41:41] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 11:41:41] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423586471.perfdata.service'
[02-10-2015 14:03:14] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 14:03:14] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423594968.perfdata.service'
[02-10-2015 14:03:14] NPCD: ERROR: Executed command exits with return code '1'
[02-10-2015 14:03:14] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1423594968.perfdata.host'

Code: Select all

tail -50 /usr/local/nagios/var/perfdata.log 
2015-02-10 07:32:17 [46066] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 07:32:17 [46064] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 07:32:17 [46062] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 07:32:17 [46066] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 07:32:17 [46064] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 07:32:17 [46066] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 07:32:17 [46064] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 07:32:17 [46062] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 07:32:17 [46062] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 07:32:17 [46063] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 07:32:17 [46063] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 07:32:17 [46063] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 07:32:17 [46062] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423571501.perfdata.host-PID-46062 deleted
2015-02-10 07:32:17 [46063] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423571501.perfdata.service-PID-46063 deleted
2015-02-10 07:32:17 [46062] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 07:32:17 [46063] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 07:32:17 [46064] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423571516.perfdata.host-PID-46064 deleted
2015-02-10 07:32:17 [46064] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 07:32:17 [46066] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423571516.perfdata.service-PID-46066 deleted
2015-02-10 07:32:17 [46066] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 08:44:15 [13869] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 08:44:15 [13868] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 08:44:15 [13869] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 08:44:15 [13868] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 08:44:15 [13869] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 08:44:15 [13868] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 08:44:15 [13869] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423575836.perfdata.service-PID-13869 deleted
2015-02-10 08:44:15 [13868] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423575836.perfdata.host-PID-13868 deleted
2015-02-10 08:44:15 [13868] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 08:44:15 [13869] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 11:41:41 [29564] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 11:41:41 [29564] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 11:41:41 [29564] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 11:41:41 [29562] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 11:41:41 [29564] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423586486.perfdata.host-PID-29564 deleted
2015-02-10 11:41:41 [29562] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 11:41:41 [29564] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 11:41:41 [29562] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 11:41:41 [29562] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423586471.perfdata.service-PID-29562 deleted
2015-02-10 11:41:41 [29562] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 14:03:14 [31988] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 14:03:14 [31989] [0] *** TIMEOUT: Timeout after 5 Sec. ****
2015-02-10 14:03:14 [31988] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 14:03:14 [31989] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2015-02-10 14:03:14 [31988] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 14:03:14 [31989] [0] *** TIMEOUT: Please check your process_perfdata.cfg
2015-02-10 14:03:14 [31988] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423594968.perfdata.host-PID-31988 deleted
2015-02-10 14:03:14 [31989] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1423594968.perfdata.service-PID-31989 deleted
2015-02-10 14:03:14 [31988] [0] *** process_perfdata.pl terminated on signal ALRM
2015-02-10 14:03:14 [31989] [0] *** process_perfdata.pl terminated on signal ALRM

Re: Capacity Planning

Posted: Wed Feb 11, 2015 11:46 am
by abrist
You are hitting the perfdata timeout. Please follow the FAQ below to increase the timeout value:
http://support.nagios.com/wiki/index.ph ... ta_Timeout

Re: Capacity Planning

Posted: Wed Feb 11, 2015 1:18 pm
by mkosmoski
How about my issue? :)

Re: Capacity Planning

Posted: Wed Feb 11, 2015 1:42 pm
by tmcdonald
mkosmoski wrote:How about my issue? :)
Can you run the same commands we asked of jkinning and post the results here?

Re: Capacity Planning

Posted: Wed Feb 11, 2015 3:28 pm
by jkinning
I"ve increased /usr/local/nagios/etc/pnp/process_perfdata.cfg TIMEOUT = 60, is there a max? and I still don't have any performance data. The Capacity Planning tab still just says NO data to display: Out of range float values are not JSON compliant: nan