Nagios Load very high
Posted: Thu Feb 13, 2020 11:22 am
Team,
We faced issue in our Nagios primary system with huge Load.
Not sure what caused the issue. At the time we have taken screenshot of top command as mentioned at bottom.
Our both primary and secondary servers are physical servers.
Current status of hosts and services are 3663 & 14678
Attached file contains:
load average screenshot, system profile, "ps -ef|grep Nagios" output, mysqld.log
Kindly let us know what might have caused the issue and need to know is everything fine in our setup.
-----------
top - 15:00:38 up 111 days, 21:22, 1 user, load average: 225.08, 433.34, 328.94
Tasks: 732 total, 24 running, 673 sleeping, 0 stopped, 35 zombie
Cpu(s): 49.6%us, 29.8%sy, 0.0%ni, 20.5%id, 0.1%wa, 0.0%hi, 0.1%si, 0.0%st
Mem: 65953056k total, 55149932k used, 10803124k free, 1861404k buffers
Swap: 33554424k total, 65852k used, 33488572k free, 45598156k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
5701 mysql 23 0 570m 83m 4668 S 119.9 0.1 16971:43 mysqld
14428 nagios 25 0 10180 1348 672 R 99.9 0.0 0:42.71 nagios
14431 nagios 25 0 10184 1348 672 R 99.9 0.0 2:10.12 nagios
14403 nagios 25 0 10192 1336 696 R 99.2 0.0 2:11.50 nagios
14452 nagios 25 0 10180 1308 672 R 98.2 0.0 2:02.73 nagios
14449 nagios 25 0 10168 1352 696 R 97.2 0.0 1:37.45 nagios
26744 apache 20 0 779m 438m 4024 S 97.2 0.7 0:53.15 httpd
14413 nagios 25 0 10224 1332 672 R 96.9 0.0 2:05.33 nagios
14419 nagios 25 0 10212 1348 672 R 96.3 0.0 0:28.79 nagios
---------------
top - 15:12:30 up 111 days, 21:34, 3 users, load average: 106.61, 101.41, 181.74
Tasks: 1466 total, 477 running, 845 sleeping, 0 stopped, 144 zombie
Cpu(s): 62.5%us, 9.1%sy, 0.0%ni, 28.1%id, 0.0%wa, 0.0%hi, 0.3%si, 0.0%st
Mem: 65953056k total, 59548628k used, 6404428k free, 1872404k buffers
Swap: 33554424k total, 65848k used, 33488576k free, 45817000k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
5701 mysql 15 0 570m 83m 4668 S 191.8 0.1 16990:15 mysqld
16315 apache 15 0 911m 568m 5036 S 63.8 0.9 7:09.76 httpd
13610 apache 16 0 475m 135m 4504 S 60.9 0.2 0:51.48 httpd
7952 apache 15 0 445m 105m 4996 S 59.9 0.2 6:53.76 httpd
27294 apache 17 0 944m 598m 5144 S 58.9 0.9 10:39.81 httpd
1125 apache 16 0 398m 57m 4856 S 52.0 0.1 2:16.10 httpd
23301 apache 16 0 393m 52m 3788 S 51.4 0.1 0:30.66 httpd
24886 apache 15 0 439m 99m 4916 S 49.4 0.2 4:45.83 httpd
24547 apache 15 0 911m 568m 4952 S 46.5 0.9 5:01.02 httpd
1126 apache 16 0 447m 107m 4824 S 42.2 0.2 2:07.66 httpd
3472 apache 16 0 452m 113m 4972 R 40.6 0.2 4:45.21 httpd
22660 apache 16 0 450m 111m 4668 S 40.3 0.2 4:24.32 httpd
22705 apache 15 0 986m 645m 4892 S 38.0 1.0 5:05.08 httpd
30377 apache 15 0 463m 123m 4824 S 36.3 0.2 2:25.59 httpd
-------------------
We faced issue in our Nagios primary system with huge Load.
Not sure what caused the issue. At the time we have taken screenshot of top command as mentioned at bottom.
Our both primary and secondary servers are physical servers.
Current status of hosts and services are 3663 & 14678
Attached file contains:
load average screenshot, system profile, "ps -ef|grep Nagios" output, mysqld.log
Kindly let us know what might have caused the issue and need to know is everything fine in our setup.
-----------
top - 15:00:38 up 111 days, 21:22, 1 user, load average: 225.08, 433.34, 328.94
Tasks: 732 total, 24 running, 673 sleeping, 0 stopped, 35 zombie
Cpu(s): 49.6%us, 29.8%sy, 0.0%ni, 20.5%id, 0.1%wa, 0.0%hi, 0.1%si, 0.0%st
Mem: 65953056k total, 55149932k used, 10803124k free, 1861404k buffers
Swap: 33554424k total, 65852k used, 33488572k free, 45598156k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
5701 mysql 23 0 570m 83m 4668 S 119.9 0.1 16971:43 mysqld
14428 nagios 25 0 10180 1348 672 R 99.9 0.0 0:42.71 nagios
14431 nagios 25 0 10184 1348 672 R 99.9 0.0 2:10.12 nagios
14403 nagios 25 0 10192 1336 696 R 99.2 0.0 2:11.50 nagios
14452 nagios 25 0 10180 1308 672 R 98.2 0.0 2:02.73 nagios
14449 nagios 25 0 10168 1352 696 R 97.2 0.0 1:37.45 nagios
26744 apache 20 0 779m 438m 4024 S 97.2 0.7 0:53.15 httpd
14413 nagios 25 0 10224 1332 672 R 96.9 0.0 2:05.33 nagios
14419 nagios 25 0 10212 1348 672 R 96.3 0.0 0:28.79 nagios
---------------
top - 15:12:30 up 111 days, 21:34, 3 users, load average: 106.61, 101.41, 181.74
Tasks: 1466 total, 477 running, 845 sleeping, 0 stopped, 144 zombie
Cpu(s): 62.5%us, 9.1%sy, 0.0%ni, 28.1%id, 0.0%wa, 0.0%hi, 0.3%si, 0.0%st
Mem: 65953056k total, 59548628k used, 6404428k free, 1872404k buffers
Swap: 33554424k total, 65848k used, 33488576k free, 45817000k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
5701 mysql 15 0 570m 83m 4668 S 191.8 0.1 16990:15 mysqld
16315 apache 15 0 911m 568m 5036 S 63.8 0.9 7:09.76 httpd
13610 apache 16 0 475m 135m 4504 S 60.9 0.2 0:51.48 httpd
7952 apache 15 0 445m 105m 4996 S 59.9 0.2 6:53.76 httpd
27294 apache 17 0 944m 598m 5144 S 58.9 0.9 10:39.81 httpd
1125 apache 16 0 398m 57m 4856 S 52.0 0.1 2:16.10 httpd
23301 apache 16 0 393m 52m 3788 S 51.4 0.1 0:30.66 httpd
24886 apache 15 0 439m 99m 4916 S 49.4 0.2 4:45.83 httpd
24547 apache 15 0 911m 568m 4952 S 46.5 0.9 5:01.02 httpd
1126 apache 16 0 447m 107m 4824 S 42.2 0.2 2:07.66 httpd
3472 apache 16 0 452m 113m 4972 R 40.6 0.2 4:45.21 httpd
22660 apache 16 0 450m 111m 4668 S 40.3 0.2 4:24.32 httpd
22705 apache 15 0 986m 645m 4892 S 38.0 1.0 5:05.08 httpd
30377 apache 15 0 463m 123m 4824 S 36.3 0.2 2:25.59 httpd
-------------------