Page 2 of 3
Re: Event Manager (eventman) stale
Posted: Fri Sep 23, 2016 2:03 pm
by rkennedy
I am still not seeing a profile in my inbox, could you PM it over once again? We'll need this to look at your logs and dig further into it.
Re: Event Manager (eventman) stale
Posted: Sun Sep 25, 2016 8:13 am
by WVUhealth
Just sent you a PM of the profile If that does not show up again I will post my gmail address and you can PM me and I will reply.
Re: Event Manager (eventman) stale
Posted: Mon Sep 26, 2016 12:42 pm
by rkennedy
Received the output from SQL, but not seeing your profile still. Please send that over and post back once you have.
EDIT: profile received.
Re: Event Manager (eventman) stale
Posted: Tue Sep 27, 2016 9:37 am
by rkennedy
Looking through your profile, it's evident there are load spikes -
Code: Select all
[09-26-2016 16:40:55] NPCD: WARN: MAX load reached: load 10.410000/10.000000 at i=1
[09-26-2016 16:50:14] NPCD: WARN: MAX load reached: load 10.830000/10.000000 at i=0
[09-26-2016 17:15:15] NPCD: WARN: MAX load reached: load 11.160000/10.000000 at i=0
How many CPU's do you have allocated to this machine?
Re: Event Manager (eventman) stale
Posted: Tue Sep 27, 2016 10:12 am
by WVUhealth
Yeah its bursty utilization. cpu two Xeon E5620 @ 2.40GHz cores = 16 cpus
As you can see it normally is idle
ATOP - 2016/09/27 11:09:57 ------ 3d22h24s elapsed
PRC | sys 229m37s | user 668m18s | | #proc 451 | | #zombie 0 | clones 943e5 | | #exit 123 |
CPU | sys 40% | user 258% | irq 1% | | idle 1226% | wait 75% | | steal 0% | guest 0% |
cpu | sys 3% | user 22% | irq 0% | | idle 72% | cpu010 w 2% | | steal 0% | guest 0% |
cpu | sys 3% | user 22% | irq 0% | | idle 73% | cpu014 w 2% | | steal 0% | guest 0% |
cpu | sys 3% | user 21% | irq 0% | | idle 71% | cpu015 w 5% | | steal 0% | guest 0% |
cpu | sys 3% | user 20% | irq 0% | | idle 75% | cpu011 w 2% | | steal 0% | guest 0% |
cpu | sys 4% | user 18% | irq 0% | | idle 76% | cpu013 w 2% | | steal 0% | guest 0% |
cpu | sys 4% | user 18% | irq 0% | | idle 76% | cpu009 w 2% | | steal 0% | guest 0% |
cpu | sys 2% | user 18% | irq 0% | | idle 75% | cpu008 w 5% | | steal 0% | guest 0% |
cpu | sys 3% | user 17% | irq 0% | | idle 67% | cpu000 w 13% | | steal 0% | guest 0% |
cpu | sys 2% | user 17% | irq 0% | | idle 76% | cpu012 w 4% | | steal 0% | guest 0% |
cpu | sys 3% | user 17% | irq 0% | | idle 68% | cpu004 w 12% | | steal 0% | guest 0% |
cpu | sys 2% | user 13% | irq 0% | | idle 80% | cpu001 w 4% | | steal 0% | guest 0% |
cpu | sys 2% | user 13% | irq 0% | | idle 80% | cpu005 w 5% | | steal 0% | guest 0% |
cpu | sys 1% | user 11% | irq 0% | | idle 85% | cpu006 w 3% | | steal 0% | guest 0% |
cpu | sys 1% | user 11% | irq 0% | | idle 84% | cpu002 w 4% | | steal 0% | guest 0% |
cpu | sys 1% | user 10% | irq 0% | | idle 85% | cpu003 w 4% | | steal 0% | guest 0% |
cpu | sys 1% | user 10% | irq 0% | | idle 84% | cpu007 w 5% | | steal 0% | guest 0% |
CPL | avg1 2.85 | avg5 4.32 | avg15 4.81 | | | csw 197278e5 | intr 99200e5 | | numcpu 16 |
Re: Event Manager (eventman) stale
Posted: Tue Sep 27, 2016 2:36 pm
by tgriep
Can you run the following commands on the XI server and post the output?
Code: Select all
su nagios -c '/usr/bin/php /usr/local/nagiosxi/cron/eventman.php'
echo 'select * from xi_events;' | psql nagiosxi nagiosxi
I want to see if there are any errors generated when running the script manually and what is logged in the events table.
Thanks
Re: Event Manager (eventman) stale
Posted: Tue Sep 27, 2016 2:53 pm
by WVUhealth
Its a good bit of data.. the dbase event shows 1329 rows
i have them attached to your PM via a reply
Re: Event Manager (eventman) stale
Posted: Tue Sep 27, 2016 4:17 pm
by tgriep
I never received your PM. Can you post the output here?
Re: Event Manager (eventman) stale
Posted: Thu Sep 29, 2016 8:35 am
by WVUhealth
attached is the output from the two commands,..
su nagios -c '/usr/bin/php /usr/local/nagiosxi/cron/eventman.php'
echo 'select * from xi_events;' | psql nagiosxi nagiosxi
Re: Event Manager (eventman) stale
Posted: Thu Sep 29, 2016 4:37 pm
by tgriep
No errors in that file.
Can you run the following and post the output?
Code: Select all
ls -l /usr/local/nagioxi/var/
ls -l /usr/local/nagiosxi/cron/
cat /etc/cron.d/nagiosxi