I am trying to schedule monthly reports for some of our ecosystems (as ERP, domain controlers, messaging servers...) to track availability.
When directly generating ("last month" period) reports with the Web GUI, reports display well after computing data a long time (around 6/8min).
During this time, browser is stuck, and server is quite long to answer to another requests... But it works.
When sending the reports by email, same behavior (browser stuck
Then I schedule the same reports to get them in the night each month in the 1st days.
I add separated hosts & services CSV files, but no PDF file.
The cron entries are created as expected and launched.
I also found the related calls to availability.php in Apache logs (HTTP code 200) at the scheduled time.
But some reports are incomplete.
The incomplete ones contain CSV files with this kind of lines :
Code: Select all
host,service,ok %,warning %,unknown %,critical %
<p><pre>SQL Error [nagiosxi] : MySQL server has gone away</pre></p>
<p><pre>SQL Error [nagiosxi] : MySQL server has gone away</pre></p>
...MariaDB:
- wait_timeout = 300
- max_allowed_packet = 1048576
Code: Select all
-------- Performance Metrics -----------------------------------------------------------------------
[--] Up for: 58d 0h 42m 14s (20B q [4K qps], 11M conn, TX: 12399G, RX: 9365G)
[--] Reads / Writes: 4% / 96%
[--] Binary logging is disabled
[--] Physical Memory : 62.3G
[--] Max MySQL memory : 3.2G
[--] Other process memory: 660.6M
[--] Total buffers: 912.0M global + 4.7M per thread (500 max threads)
[--] P_S Max memory usage: 0B
[--] Galera GCache Max memory usage: 0B
[OK] Maximum reached memory usage: 2.5G (4.00% of installed RAM)
[OK] Maximum possible memory usage: 3.2G (5.08% of installed RAM)
[OK] Overall possible memory usage with other process is compatible with memory available
[OK] Slow queries: 0% (353/20B)
[OK] Highest usage of available connections: 70% (352/500)
[OK] Aborted connections: 0.19% (21331/11344995)
[OK] Query cache is disabled by default due to mutex contention on multiprocessor machines.
[!!] Sorts requiring temporary tables: 23% (5M temp sorts / 24M sorts)
[!!] Joins performed without indexes: 676053
[OK] Temporary tables created on disk: 19% (3M on disk / 17M total)
[OK] Thread cache hit rate: 73% (3M created / 11M connections)
[!!] Table cache hit rate: 0% (309 open / 141K opened)
[OK] Open file limit used: 17% (436/2K)
[OK] Table locks acquired immediately: 99% (20B immediate / 20B locks)
Do you have any ideas to debug what happens ?
Thank you
Nagios XI 5.4.13 - 2251 Hosts, 13440 related services
Dedicated physical server / 2x20 Intel(R) Xeon(R) Silver 4114 CPU @ 2.20GHz / 64GB RAM
RHEL 7.4 / MariaDB 5.5.56-2 /Apache HTTPd 2.4.6-67