Page 1 of 1

EC2 100% CPU usage for Nagios XI

Posted: Thu Jul 10, 2014 5:04 pm
by mengerink
I have deployed an instance per the following instructions:

http://assets.nagios.com/downloads/nagi ... -Cloud.pdf

I have configured Nagios to monitor my mysql instance as well as a website. When I click on the monitoring tab of the instance running NagiosXI, it shows that the CPU has been at 100% nearly the entire time the service has been up.

Has anyone else suffered this? Does anyone know what to do to fix this so that the CPU isn't getting chewed up so much by Nagios.

Re: EC2 100% CPU usage for Nagios XI

Posted: Fri Jul 11, 2014 9:23 am
by tmcdonald
Can you run the "top" command on the CLI and post the results?

Re: EC2 100% CPU usage for Nagios XI

Posted: Mon Jul 14, 2014 2:42 pm
by mengerink
Here goes. This is showing a 2.6% usage, but when I go to the AWS monitoring tab, it is still saying 100% which you can see in the attached image.

Code: Select all

top - 12:32:29 up 3 days, 22:25,  1 user,  load average: 0.72, 0.75, 0.72
Tasks: 131 total,   1 running, 130 sleeping,   0 stopped,   0 zombie
Cpu(s):  2.6%us,  0.9%sy,  0.0%ni, 96.2%id,  0.3%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:    604940k total,   432352k used,   172588k free,    10120k buffers
Swap:        0k total,        0k used,        0k free,    60880k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
16723 nagios    20   0  182m  13m 3324 S  5.0  2.2   0:00.15 check_mysql_hea
 2164 nagios    20   0 50412 1096  340 S  0.3  0.2   0:42.70 ndo2db
16561 nagios    20   0  217m  22m 7788 S  0.3  3.7   0:00.21 php
16663 ec2-user  20   0 15028 1328 1000 R  0.3  0.2   0:00.05 top
    1 root      20   0 19224  468  180 S  0.0  0.1   0:00.36 init
    2 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kthreadd
    3 root      RT   0     0    0    0 S  0.0  0.0   0:00.00 migration/0
    4 root      20   0     0    0    0 S  0.0  0.0   0:00.24 ksoftirqd/0
    5 root      RT   0     0    0    0 S  0.0  0.0   0:00.00 migration/0
    6 root      RT   0     0    0    0 S  0.0  0.0   0:00.00 watchdog/0
    7 root      20   0     0    0    0 S  0.0  0.0   0:00.17 events/0
    8 root      20   0     0    0    0 S  0.0  0.0   0:00.00 cpuset
    9 root      20   0     0    0    0 S  0.0  0.0   0:00.00 khelper
   10 root      20   0     0    0    0 S  0.0  0.0   0:00.00 netns
   11 root      20   0     0    0    0 S  0.0  0.0   0:00.00 async/mgr
   12 root      20   0     0    0    0 S  0.0  0.0   0:00.00 pm
   13 root      20   0     0    0    0 S  0.0  0.0   0:00.00 xenwatch
   14 root      20   0     0    0    0 S  0.0  0.0   0:00.00 xenbus
   15 root      20   0     0    0    0 S  0.0  0.0   0:00.00 sync_supers
   16 root      20   0     0    0    0 S  0.0  0.0   0:00.00 bdi-default
   17 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kintegrityd/0
   18 root      20   0     0    0    0 S  0.0  0.0   0:03.88 kblockd/0
   19 root      20   0     0    0    0 S  0.0  0.0   0:00.00 ata/0
   20 root      20   0     0    0    0 S  0.0  0.0   0:00.00 ata_aux
   21 root      20   0     0    0    0 S  0.0  0.0   0:00.00 ksuspend_usbd  
   22 root      20   0     0    0    0 S  0.0  0.0   0:00.00 khubd
   23 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kseriod
   24 root      20   0     0    0    0 S  0.0  0.0   0:00.00 md/0
   25 root      20   0     0    0    0 S  0.0  0.0   0:00.00 md_misc/0
   26 root      20   0     0    0    0 S  0.0  0.0   0:00.00 khungtaskd
   27 root      20   0     0    0    0 S  0.0  0.0   0:08.57 kswapd0
   28 root      25   5     0    0    0 S  0.0  0.0   0:00.00 ksmd
   29 root      20   0     0    0    0 S  0.0  0.0   0:00.00 aio/0
   30 root      20   0     0    0    0 S  0.0  0.0   0:00.00 crypto/0
   35 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kthrotld/0
   37 root      20   0     0    0    0 S  0.0  0.0   0:00.00 khvcd
   38 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kpsmoused
   39 root      20   0     0    0    0 S  0.0  0.0   0:00.00 usbhid_resumer
  147 root      20   0     0    0    0 S  0.0  0.0   0:17.48 jbd2/xvde1-8
  148 root      20   0     0    0    0 S  0.0  0.0   0:00.00 ext4-dio-unwrit
  213 root      16  -4 10780  516   52 S  0.0  0.1   0:00.03 udevd
  404 postgres  20   0  212m 5732 3532 S  0.0  0.9   0:01.32 postmaster  
  412 root      18  -2 10776  516   52 S  0.0  0.1   0:00.00 udevd
  415 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kstriped                
========================

Re: EC2 100% CPU usage for Nagios XI

Posted: Mon Jul 14, 2014 4:01 pm
by sreinhardt
Your top looks pretty darn normal, but I agree the ec2 screen is a bit odd. If you wouldn't mind, let's go ahead and run the nagiostats and nagios xi server wizards against localhost. This should allow us to collect some additional system info on a continual basis and see if the ec2 metrics match up. Just a note, you don't need to set notifications on any of this, it's purely for statistics unless you want to be notified for these.

Re: EC2 100% CPU usage for Nagios XI

Posted: Tue Jul 15, 2014 12:40 pm
by mengerink
Ok, I've run for a day.

Here's the current info:

Last Update: Tue Jul 15 2014 10:38:38 GMT-0700 (PDT)
Nagios BPI v2.31
Nagios Enterprises, LLC
BPI Documentation
Manually Edit Config

Essential group members are denoted with: **
Ok Local Services URL Group health is 100.00% with 0 problem(s) Example BPI Group Edit Delete
Ok localhost Current Load
OK - load average: 0.67, 0.61, 0.57
Ok localhost Current Users USERS OK - 0 users currently logged in
Ok localhost HTTP HTTP OK: HTTP/1.1 200 OK - 3193 bytes in 0.002 second response time
Ok** localhost PING PING OK - Packet loss = 0%, RTA = 0.07 ms

Re: EC2 100% CPU usage for Nagios XI

Posted: Tue Jul 15, 2014 4:09 pm
by Box293
mengerink wrote:When I click on the monitoring tab of the instance running NagiosXI, it shows that the CPU has been at 100% nearly the entire time the service has been up
Are you able to view a graph that shows actual CPU usage as MHz instead of a percentage?

Just trying to look at things from a different angle.

Re: EC2 100% CPU usage for Nagios XI

Posted: Tue Jul 15, 2014 7:37 pm
by mengerink
No, I do not seem to be able to change the units on the graph.

Re: EC2 100% CPU usage for Nagios XI

Posted: Wed Jul 16, 2014 3:09 pm
by tmcdonald
I believe these two articles are of interest:

http://www.axibase.com/cloud/2010/07/22 ... tolen-cpu/
http://stackoverflow.com/questions/1170 ... -going-off

Essentially, top might be lying because the VM "steals" CPU time in order to run other things. The guest doesn't notice but the host does.