availability reports slow

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
perric
Posts: 161
Joined: Fri Mar 28, 2014 10:37 am

availability reports slow

Post by perric »

Moderator Edit: This thread has been split from another - https://support.nagios.com/forum/viewto ... =6&t=35886
In the future, please create a new thread and link to the old one instead of adding on.


I am having a similar issue on my environment (the availability report takes about 5 minutes to generate, which is especially frustrating since what I need to run it for is not the default parameters). I figure I would post here as I don't see what the eventual fix was for the other users experiencing the issue.

Nagios version: 2014 R2.7 ... in process of being upgraded to 5.3.0 :)
Nagios db: not offloaded
Specs: 4 vcpu 8 GB RAM 200GB HD
Availability report contains approx 20-50 hosts for a time period of prior month

Code: Select all

Nagios Stats 4.0.8
Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org)
Last Modified: 08-12-2014
License: GPL

CURRENT STATUS DATA
------------------------------------------------------
Status File:                            /usr/local/nagios/var/status.dat
Status File Age:                        0d 0h 0m 3s
Status File Version:                    4.0.8

Program Running Time:                   0d 0h 15m 23s
Nagios PID:                             579

Total Services:                         4531
Services Checked:                       4531
Services Scheduled:                     4531
Services Actively Checked:              4531
Services Passively Checked:             0
Total Service State Change:             0.000 / 17.500 / 0.054 %
Active Service Latency:                 0.000 / 0.141 / 0.001 sec
Active Service Execution Time:          0.001 / 70.007 / 0.805 sec
Active Service State Change:            0.000 / 17.500 / 0.054 %
Active Services Last 1/5/15/60 min:     265 / 3986 / 4171 / 4530
Passive Service Latency:                0.000 / 0.000 / 0.000 sec
Passive Service State Change:           0.000 / 0.000 / 0.000 %
Passive Services Last 1/5/15/60 min:    0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit:              4404 / 34 / 26 / 67
Services Flapping:                      2
Services In Downtime:                   58

Total Hosts:                            1076
Hosts Checked:                          1075
Hosts Scheduled:                        1076
Hosts Actively Checked:                 1076
Host Passively Checked:                 0
Total Host State Change:                0.000 / 8.820 / 0.017 %
Active Host Latency:                    0.000 / 0.087 / 0.000 sec
Active Host Execution Time:             0.000 / 10.032 / 0.364 sec
Active Host State Change:               0.000 / 8.820 / 0.017 %
Active Hosts Last 1/5/15/60 min:        641 / 1073 / 1074 / 1074
Passive Host Latency:                   0.000 / 0.000 / 0.000 sec
Passive Host State Change:              0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min:       0 / 0 / 0 / 0
Hosts Up/Down/Unreach:                  1061 / 15 / 0
Hosts Flapping:                         0
Hosts In Downtime:                      5

Active Host Checks Last 1/5/15 min:     713 / 3436 / 10176
   Scheduled:                           712 / 3410 / 10100
   On-demand:                           1 / 26 / 76
   Parallel:                            712 / 3410 / 10100
   Serial:                              0 / 0 / 0
   Cached:                              1 / 26 / 76
Passive Host Checks Last 1/5/15 min:    0 / 0 / 0
Active Service Checks Last 1/5/15 min:  298 / 4054 / 12478
   Scheduled:                           298 / 4054 / 12478
   On-demand:                           0 / 0 / 0
   Cached:                              0 / 0 / 0
Passive Service Checks Last 1/5/15 min: 0 / 0 / 0

External Commands Last 1/5/15 min:      0 / 0 / 2

Code: Select all

[coadminsj@cawlkl21 ~]$ ls -l /usr/local/nagios/var/nagios.log
-rw-r--r-- 1 nagios nagios 2742481 Nov  1 13:33 /usr/local/nagios/var/nagios.log
[coadminsj@cawlkl21 ~]$ du -h /usr/local/nagios/var/archives/
135M    /usr/local/nagios/var/archives/
[coadminsj@cawlkl21 ~]$ ls /usr/local/nagios/var/archives/|wc -l
6

Code: Select all

top | head -17
top - 13:40:35 up 79 days,  4:50,  1 user,  load average: 2.04, 2.53, 2.29
Tasks: 274 total,   1 running, 273 sleeping,   0 stopped,   0 zombie
Cpu(s):  7.8%us,  2.8%sy,  0.0%ni, 88.2%id,  1.0%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   8177336k total,  7208852k used,   968484k free,   405612k buffers
Swap: 16777212k total,   361748k used, 16415464k free,  3613960k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
  583 nagios    20   0 10072  996  664 S  1.9  0.0   0:01.31 nagios
  585 nagios    20   0 10072  992  668 S  1.9  0.0   0:01.22 nagios
    1 root      20   0 21500 1204  976 S  0.0  0.0   0:16.88 init
    2 root      20   0     0    0    0 S  0.0  0.0   0:01.52 kthreadd
    3 root      20   0     0    0    0 S  0.0  0.0   1:48.81 ksoftirqd/0
    5 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 kworker/0:0H
    7 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 kworker/u:0H
    8 root      RT   0     0    0    0 S  0.0  0.0   5:49.32 migration/0
    9 root      20   0     0    0    0 S  0.0  0.0   0:00.00 rcu_bh
   10 root      20   0     0    0    0 S  0.0  0.0 165:17.04 
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: availability reports slow

Post by avandemore »

Can you monitor top during one of these timeperiod for generating an availability report? Please use this command and paste the output:

Code: Select all

# SECONDS=0; while [ ${SECONDS} -lt '301' ]; do top -bcn1|head -n20; sleep 30; done
Also what is the output of this during an availability generation:

Code: Select all

for file in /proc/*/status ; do awk '/VmSwap|Name/{printf $2 " " $3}END{ print ""}' $file; done | sort -nrk 2
and the output of this:

Code: Select all

df -h
Previous Nagios employee
perric
Posts: 161
Joined: Fri Mar 28, 2014 10:37 am

Re: availability reports slow

Post by perric »

Here you go - didn't quite run 5 minutes so I stopped the top once the report finished.

Code: Select all

[coadminsj@cawlkl21 ~]$ SECONDS=0; while [ ${SECONDS} -lt '301' ]; do top -bcn1|head -n20; sleep 30; done
top - 16:15:54 up 79 days,  7:26,  2 users,  load average: 1.95, 2.11, 2.07
Tasks: 284 total,   2 running, 282 sleeping,   0 stopped,   0 zombie
Cpu(s):  7.8%us,  2.8%sy,  0.0%ni, 88.2%id,  1.0%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   8177336k total,  6319656k used,  1857680k free,   108200k buffers
Swap: 16777212k total,   369360k used, 16407852k free,  4242312k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
28139 apache    20   0  239m  85m  17m R 95.5  1.1   0:04.47 /usr/local/nagios/sbin/avail.cgi
  579 nagios    20   0 43536  14m 1560 S 19.5  0.2   2:28.52 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
 3135 mysql     20   0 2202m  49m 3856 S  7.8  0.6   5486:36 /usr/libexec/mysqld --basedir=/usr --datadir=/var/lib/mysql --user=mysql --log-error=/var/log/mysqld.log --
  593 nagios    20   0 54844 3988  928 S  3.9  0.0   1:19.50 /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg
28200 root      20   0  201m 3212 2304 S  1.9  0.0   0:00.03 winbindd
28224 nagios    20   0  131m 2684 2188 S  1.9  0.0   0:00.01 /usr/local/nagios/libexec/check_http -H 10.100.103.147 -u /retailPriceCatalogue/help.jsp -S -a WEBAPP01:new
    1 root      20   0 21500 1204  976 S  0.0  0.0   0:16.88 /sbin/init
    2 root      20   0     0    0    0 S  0.0  0.0   0:01.52 [kthreadd]
    3 root      20   0     0    0    0 S  0.0  0.0   1:49.04 [ksoftirqd/0]
    5 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/0:0H]
    7 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/u:0H]
    8 root      RT   0     0    0    0 S  0.0  0.0   5:49.74 [migration/0]
    9 root      20   0     0    0    0 S  0.0  0.0   0:00.00 [rcu_bh]
top - 16:16:24 up 79 days,  7:26,  2 users,  load average: 3.04, 2.35, 2.16
Tasks: 287 total,   2 running, 285 sleeping,   0 stopped,   0 zombie
Cpu(s):  7.8%us,  2.8%sy,  0.0%ni, 88.2%id,  1.0%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   8177336k total,  6440764k used,  1736572k free,   108336k buffers
Swap: 16777212k total,   369360k used, 16407852k free,  4242624k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
28139 apache    20   0  343m 213m  40m R 99.8  2.7   0:31.14 /usr/local/nagios/sbin/avail.cgi
29258 coadmins  20   0 19476 1376  908 R  2.0  0.0   0:00.01 top -bcn1
    1 root      20   0 21500 1204  976 S  0.0  0.0   0:16.88 /sbin/init
    2 root      20   0     0    0    0 S  0.0  0.0   0:01.52 [kthreadd]
    3 root      20   0     0    0    0 S  0.0  0.0   1:49.04 [ksoftirqd/0]
    5 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/0:0H]
    7 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/u:0H]
    8 root      RT   0     0    0    0 S  0.0  0.0   5:49.75 [migration/0]
    9 root      20   0     0    0    0 S  0.0  0.0   0:00.00 [rcu_bh]
   10 root      20   0     0    0    0 S  0.0  0.0 165:36.02 [rcu_sched]
   11 root      RT   0     0    0    0 S  0.0  0.0   0:44.45 [watchdog/0]
   12 root      RT   0     0    0    0 S  0.0  0.0   4:10.81 [watchdog/1]
   13 root      20   0     0    0    0 S  0.0  0.0   9:01.82 [ksoftirqd/1]
top - 16:16:55 up 79 days,  7:27,  2 users,  load average: 2.14, 2.21, 2.11
Tasks: 285 total,   2 running, 283 sleeping,   0 stopped,   0 zombie
Cpu(s):  7.8%us,  2.8%sy,  0.0%ni, 88.2%id,  1.0%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   8177336k total,  6545564k used,  1631772k free,   108516k buffers
Swap: 16777212k total,   369360k used, 16407852k free,  4242984k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
28139 apache    20   0  436m 308m  19m R 97.7  3.9   1:01.50 /usr/local/nagios/sbin/avail.cgi
 2174 root      20   0  175m 2344 1856 S  2.0  0.0  75:17.60 /usr/sbin/vmtoolsd
 3135 mysql     20   0 2202m  50m 3856 S  2.0  0.6   5486:40 /usr/libexec/mysqld --basedir=/usr --datadir=/var/lib/mysql --user=mysql --log-error=/var/log/mysqld.log --
30497 coadmins  20   0 19476 1376  908 R  2.0  0.0   0:00.02 top -bcn1
    1 root      20   0 21500 1204  976 S  0.0  0.0   0:16.88 /sbin/init
    2 root      20   0     0    0    0 S  0.0  0.0   0:01.52 [kthreadd]
    3 root      20   0     0    0    0 S  0.0  0.0   1:49.04 [ksoftirqd/0]
    5 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/0:0H]
    7 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/u:0H]
    8 root      RT   0     0    0    0 S  0.0  0.0   5:49.75 [migration/0]
    9 root      20   0     0    0    0 S  0.0  0.0   0:00.00 [rcu_bh]
   10 root      20   0     0    0    0 S  0.0  0.0 165:36.07 [rcu_sched]
   11 root      RT   0     0    0    0 S  0.0  0.0   0:44.45 [watchdog/0]
top - 16:17:25 up 79 days,  7:27,  2 users,  load average: 4.76, 2.83, 2.32
Tasks: 282 total,   2 running, 280 sleeping,   0 stopped,   0 zombie
Cpu(s):  7.8%us,  2.8%sy,  0.0%ni, 88.2%id,  1.0%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   8177336k total,  6669284k used,  1508052k free,   108672k buffers
Swap: 16777212k total,   369360k used, 16407852k free,  4243864k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
28139 apache    20   0  520m 414m  41m R 100.0  5.2   1:25.27 /usr/local/nagios/sbin/avail.cgi
  593 nagios    20   0 54844 3988  928 S  2.0  0.0   1:20.30 /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg
 2133 root      20   0  199m 3484 2580 S  2.0  0.0   0:00.02 winbindd
 3135 mysql     20   0 2202m  50m 3856 S  2.0  0.6   5486:44 /usr/libexec/mysqld --basedir=/usr --datadir=/var/lib/mysql --user=mysql --log-error=/var/log/mysqld.log --
    1 root      20   0 21500 1204  976 S  0.0  0.0   0:16.88 /sbin/init
    2 root      20   0     0    0    0 S  0.0  0.0   0:01.52 [kthreadd]
    3 root      20   0     0    0    0 S  0.0  0.0   1:49.04 [ksoftirqd/0]
    5 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/0:0H]
    7 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/u:0H]
    8 root      RT   0     0    0    0 S  0.0  0.0   5:49.75 [migration/0]
    9 root      20   0     0    0    0 S  0.0  0.0   0:00.00 [rcu_bh]
   10 root      20   0     0    0    0 S  0.0  0.0 165:36.13 [rcu_sched]
   11 root      RT   0     0    0    0 S  0.0  0.0   0:44.45 [watchdog/0]
top - 16:17:56 up 79 days,  7:28,  2 users,  load average: 3.53, 2.72, 2.30
Tasks: 280 total,   1 running, 279 sleeping,   0 stopped,   0 zombie
Cpu(s):  7.8%us,  2.8%sy,  0.0%ni, 88.2%id,  1.0%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   8177336k total,  6227204k used,  1950132k free,   108792k buffers
Swap: 16777212k total,   369360k used, 16407852k free,  4243856k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
27853 apache    20   0  493m  48m 6016 S 25.2  0.6   1:55.75 /usr/sbin/httpd
15735 apache    20   0  549m 104m 6388 S 23.3  1.3  11:12.65 /usr/sbin/httpd
  586 nagios    20   0 10072  988  668 S  1.9  0.0   0:14.52 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
  590 nagios    20   0 51984 1084  768 S  1.9  0.0   0:11.37 /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg
 3076 nagios    20   0  106m  644  528 S  1.9  0.0   0:00.01 /usr/local/nagios/libexec/check_icmp -H 10.69.64.101 -w 3000.0 80  -c 5000.0 100  -p 5
 3135 mysql     20   0 2202m  49m 3856 S  1.9  0.6   5486:45 /usr/libexec/mysqld --basedir=/usr --datadir=/var/lib/mysql --user=mysql --log-error=/var/log/mysqld.log --
 3202 postgres  20   0  211m 1236  820 S  1.9  0.0   5:19.96 postgres: autovacuum launcher process
 3374 coadmins  20   0 19476 1364  908 R  1.9  0.0   0:00.01 top -bcn1
30713 postgres  20   0  212m 6064 4440 S  1.9  0.1   0:00.03 postgres: nagiosxi nagiosxi ::1(32957) idle
    1 root      20   0 21500 1204  976 S  0.0  0.0   0:16.88 /sbin/init
    2 root      20   0     0    0    0 S  0.0  0.0   0:01.52 [kthreadd]
    3 root      20   0     0    0    0 S  0.0  0.0   1:49.04 [ksoftirqd/0]
    5 root       0 -20     0    0    0 S  0.0  0.0   0:00.00 [kworker/0:0H]

Code: Select all

[root@cawlkl21 coadminsj]# for file in /proc/*/status ; do awk '/VmSwap|Name/{pr                                                                                        intf $2 " " $3}END{ print ""}' $file; done | sort -nrk 2
snmptt 204356 kB
NetworkManager 34508 kB
httpd 24468 kB
httpd 24460 kB
httpd 24452 kB
httpd 24448 kB
httpd 24448 kB
httpd 24448 kB
httpd 24448 kB
httpd 24448 kB
httpd 24448 kB
httpd 24448 kB
httpd 24444 kB
httpd 24444 kB
httpd 24436 kB
httpd 24424 kB
httpd 24364 kB
httpd 24364 kB
httpd 24364 kB
httpd 24364 kB
httpd 24352 kB
httpd 24300 kB
httpd 24300 kB
httpd 24288 kB
httpd 24288 kB
snmptt 19140 kB
mysqld 11396 kB
gnome-settings- 9604 kB
Xorg 8740 kB
python 5556 kB
gdm-simple-gree 3108 kB
plymouth-log-vi 2992 kB
gconfd-2 2836 kB
polkitd 2764 kB
metacity 2280 kB
puppet 2176 kB
python 2108 kB
python 2108 kB
python 2068 kB
gnome-power-man 1840 kB
nsrexecd 1676 kB
sendmail 1668 kB
polkit-gnome-au 1648 kB
gnome-session 1436 kB
sendmail 1352 kB
snmptrapd 1300 kB
console-kit-dae 1088 kB
winbindd 916 kB
winbindd 912 kB
cupsd 888 kB
winbindd 856 kB
winbindd 840 kB
winbindd 780 kB
winbindd 780 kB
winbindd 780 kB
bonobo-activati 772 kB
winbindd 760 kB
winbindd 760 kB
winbindd 752 kB
winbindd 752 kB
winbindd 728 kB
udevd 696 kB
winbindd 680 kB
postmaster 648 kB
pulseaudio 644 kB
gdm-simple-slav 636 kB
postmaster 632 kB
postmaster 620 kB
postmaster 616 kB
at-spi-registry 612 kB
winbindd 592 kB
vsftpd 592 kB
sshd 592 kB
winbindd 588 kB
winbindd 588 kB
postmaster 584 kB
rpc.mountd 576 kB
winbindd 568 kB
postmaster 556 kB
udevd 552 kB
udevd 552 kB
certmonger 540 kB
postmaster 512 kB
postmaster 512 kB
postmaster 512 kB
postmaster 512 kB
abrtd 512 kB
winbindd 508 kB
postmaster 500 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
postmaster 488 kB
crond 484 kB
abrt-dump-oops 468 kB
winbindd 464 kB
rpc.statd 456 kB
ntpd 448 kB
rsyslogd 420 kB
dbus-daemon 400 kB
crond 396 kB
crond 396 kB
crond 396 kB
crond 396 kB
crond 396 kB
vmtoolsd 376 kB
hald 376 kB
automount 372 kB
gdm-session-wor 360 kB
devkit-power-da 316 kB
ndo2db 292 kB
gvfsd 284 kB
wpa_supplicant 276 kB
dbus-launch 268 kB
modem-manager 264 kB
mysqld_safe 232 kB
rpc.idmapd 212 kB
hald-runner 208 kB
irqbalance 200 kB
oddjobd 188 kB
ndo2db 180 kB
ndo2db 176 kB
atd 168 kB
rtkit-daemon 164 kB
dbus-daemon 164 kB
hald-addon-inpu 160 kB
xinetd 144 kB
rpc.rquotad 144 kB
hald-addon-acpi 144 kB
auditd 128 kB
gdm-binary 124 kB
mcelog 108 kB
acpid 108 kB
init 104 kB
rpcbind 100 kB
mingetty 72 kB
mingetty 72 kB
mingetty 72 kB
mingetty 68 kB
mingetty 68 kB
rhnsd 32 kB
npcd 20 kB
watchdog/3
watchdog/2
watchdog/1
watchdog/0
ttm_swap
su 0 kB
sshd 0 kB
sshd 0 kB
sshd 0 kB
sshd 0 kB
sort 0 kB
snmpget 0 kB
snmpget 0 kB
snmpget 0 kB
snmpget 0 kB
sleep 0 kB
sh 0 kB
sh 0 kB
sh 0 kB
sh 0 kB
sh 0 kB
scsi_eh_2
scsi_eh_1
scsi_eh_0
rpciod
rcu_sched
rcu_bh
python 0 kB
php 0 kB
php 0 kB
php 0 kB
php 0 kB
php 0 kB
nfsd4_callbacks
nfsd4
nfsd
nfsd
nfsd
nfsd
nfsd
nfsd
nfsd
nfsd
netns
nagios 0 kB
nagios 0 kB
nagios 0 kB
nagios 0 kB
nagios 0 kB
nagios 0 kB
nagios 0 kB
nagios 0 kB
mpt_poll_0
mpt/0
migration/3
migration/2
migration/1
migration/0
md
lockd
kworker/u:1
kworker/u:0H
kworker/u:0
kworker/3:2
kworker/3:1H
kworker/3:0H
kworker/3:0
kworker/2:1H
kworker/2:1
kworker/2:0H
kworker/2:0
kworker/1:2
kworker/1:1H
kworker/1:1
kworker/1:0H
kworker/1:0
kworker/0:1H
kworker/0:1
kworker/0:0H
kworker/0:0
kthrotld
kthreadd
kswapd0
ksoftirqd/3
ksoftirqd/2
ksoftirqd/1
ksoftirqd/0
ksmd
kpsmoused
kintegrityd
khungtaskd
khubd
khelper
kdmflush
kdmflush
kdevtmpfs
kblockd
kauditd
jbd2/sda1-8
jbd2/dm-0-8
fsnotify_mark
flush-252:0
ext4-dio-unwrit
ext4-dio-unwrit
deferwq
crypto
cpuset
check_snmp 0 kB
check_snmp 0 kB
check_snmp 0 kB
check_snmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_icmp 0 kB
check_cisco_bw_ 0 kB
check_cisco_bw_ 0 kB
check_cisco_bw_ 0 kB
check_cisco_bw_ 0 kB
bdi-default
bash 0 kB
bash 0 kB
bash 0 kB
bash 0 kB
awk 0 kB
avail.cgi 0 kB
ata_sff

Code: Select all

[coadminsj@cawlkl21 ~]$ df -h
Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vg_cawlkl21-lv_root
                      181G  107G   66G  63% /
tmpfs                 3.9G   68K  3.9G   1% /dev/shm
/dev/sda1             477M  189M  259M  43% /boot
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: availability reports slow

Post by avandemore »

You have an awful lot running on that system. Nagios XI is design to be used on clean installs for a server eg no GUI. The system swap is also in use, this will effect system speed in multiple ways. You can try to free up some memory by disabling unneeded components, or just increase the memory in the system.
Maximizing Performance In Nagios XI

At least several of those steps may greatly improve your performance although they can hurt it as well if the system is too low in memory.
Previous Nagios employee
perric
Posts: 161
Joined: Fri Mar 28, 2014 10:37 am

Re: availability reports slow

Post by perric »

I increased the memory but it didn't seem to make any difference in the speed of the report. There was still 6GB free memory so I don't think it's the bottleneck.

Maybe my expectations are just out of line for this? Is the availability report expected to run for a few minutes on a system of my size?

We are planning an upgrade and then migration to the 5.3.0 OVA to remove the overhead of the unneeded services on the current build and to get code current. I can test again afterwards to see if the performance is any better.

Thanks
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: availability reports slow

Post by avandemore »

Is the availability report expected to run for a few minutes on a system of my size?
No it shouldn't take that long for a system of your size. As noted in the referenced thread, the time to generate the report is almost entirely driven by disk IO. Is that abnormally slow for some reason eg slow disk, network storage, etc?

Can you download the CSV of the report and let us know how long it takes?

XI > Admin > System Profile > Download Profile

Please include the zip file in your response. You can PM myself or other support personnel if you have privacy concerns.

Also please send a screen of XI > Admin > Monitoring Engine Status

FYI, 5.3.2 is the current release.
Previous Nagios employee
perric
Posts: 161
Joined: Fri Mar 28, 2014 10:37 am

Re: availability reports slow

Post by perric »

For the default availability report, CSV: hosts = 28 sec, services = 1:49. Web view is 2:16.
Sys profile and monitoring screenshot attached

edit: also, it is running on our tier 1 SSD backed SAN, so there should be no issue with disk access speeds.

Thanks
You do not have the required permissions to view the files attached to this post.
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: availability reports slow

Post by avandemore »

Apache is reporting strange periodic error like this: incomplete redirection target of '/nagiosxi/' for URI '/' modified to 'http://10.100.224.121/nagiosxi/

At this point since you're planning an upgrade anyways, let's do that first as a rather significant amount of things have changed under the hood.

Remember do the upgrade first, use this process: https://assets.nagios.com/downloads/nag ... nstall.pdf

For the migration, use this: https://assets.nagios.com/downloads/nag ... ios-XI.pdf

For best results, restore on to a clean install of CentOS/RHEL 6/7 minimal. Only addition rpm needed to bootstrap is wget.

Let us know if that helped or not.
Previous Nagios employee
perric
Posts: 161
Joined: Fri Mar 28, 2014 10:37 am

Re: availability reports slow

Post by perric »

Ok, will do.
The apache error is probably caused by a customization I did that redirects / to /nagiosxi/ for convenience
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: availability reports slow

Post by avandemore »

That line likely needs http://10.100.224.121/ in front of the target URL then, plus an apache restart.

As a general rule, anything in the error_log shouldn't be there.
Previous Nagios employee
Locked