System Profile: Linux Centos 6.6:
Code: Select all
root@nagios:~ $ uname -ra
Linux nagios 2.6.32-504.12.2.el6.i686 #1 SMP Wed Mar 11 19:05:53 UTC 2015 i686 i686 i386 GNU/Linux
I posted initally under a different account: https://support.nagios.com/forum/viewto ... =6&t=32785
I've freed up more space so MYSQL sin't throwing errors anymore. But I'm sill getting the CPU spike but less frequently.
I hacked together a script to log the processes that are on the CPU every 2 secs when a spike starts:
Code: Select all
#!/bin/bash
load_threshold=2
end=$((SECONDS+86400))
while [ $SECONDS -lt $end ]; do
loadavg=$(cat /proc/loadavg | awk '{print $1}' | cut -c1)
if [ $loadavg -gt $load_threshold ]
then
date >> cpu.log
ps aruxw | awk 'NR>1'| awk '{print $1, $3, $5, $9, $10, $11, $12, $13, $14, $15, $16, $17, $18, $19}' >> cpu.log
fi
sleep 2s
:
done
Load graph from the Nagios GUI:
As the 15min load jump is almost instantaneous at the start (and exactly the same values for 15 mins) I think the check isn't getting done at all until the end of the spike and 'fills in' the values from the current load.
Thanks,
Ira.