I see this consuming a lot of CPU:
6654 root 20 0 9768056 361804 40084 S 305.6 1.1 8087:20 ds_am
Please try disabling that deep security agent and see if that is slowing down your checks and causing them to hit a limit and timeout. The assumption is that everything that nagios is doing is slowed down by the agent scanning for threats. That would be my first guess based on what you're saying. If that resolves it you would either need to contact the agent vendor and ask them what can be done or increase the timeouts on your checks.
I see these as well (will cause gaps in your graphs):
[01-05-2022 18:05:30] NPCD: WARN: MAX load reached: load 48.040000/10.000000 at i=1
[01-05-2022 18:05:45] NPCD: WARN: MAX load reached: load 49.040000/10.000000 at i=1
[01-05-2022 18:06:00] NPCD: WARN: MAX load reached: load 47.240000/10.000000 at i=1
[01-05-2022 18:06:15] NPCD: WARN: MAX load reached: load 50.650000/10.000000 at i=1
2022-01-01 19:59:13 [10780] [0] *** TIMEOUT: Timeout after 20 secs. ***
2022-01-01 19:59:13 [10780] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2022-01-01 19:59:13 [10780] [0] *** TIMEOUT: Please check your npcd.cfg
Please follow this guide to set your load_threshold to
80.0 and your TIMEOUT to
40:
https://support.nagios.com/kb/article.php?id=9Please send the output of these commands:
- Code: Select all
ulimit -a
su -s /bin/bash -c 'ulimit -a' nagios
su -s /bin/bash -c 'ulimit -a' mysql
su -s /bin/bash -c 'ulimit -a' apache
netstat -s
ethtool -S eth0
Additionally, please send the output of this command:
- NOTE: You may need to adjust the -uroot and -pnagiosxi in the command if you've changed the root mysql password
- Code: Select all
echo "SELECT table_name AS 'Table', round(((data_length + index_length) / 1024 / 1024), 2) 'Size in MB' FROM information_schema.TABLES WHERE table_schema IN ('nagios', 'nagiosql', 'nagiosxi');" | mysql -uroot -pnagiosxi --table