Page 1 of 1

high CPU, database connection error, blue Nagvis map

Posted: Mon Dec 24, 2018 1:12 am
by zaji_nms
Dear Expert

We facing high CPU Load on our NagiosXI server intermittently (random time) and becoming okay without any action from our end....we observing below
Nagvis MAP going blue for 30 to 50 seconds
on Home Operations Center giving error : database connection error has been detected....repair_databases.sh

10009 apache 20 0 160m 6016 3968 R 30.0 0.0 0:00.91 rrdtool
10059 apache 20 0 180m 17m 15m R 24.1 0.1 0:00.73 rrdtool
15263 apache 20 0 497m 36m 19m S 20.4 0.1 0:03.86 httpd
9054 mysql 20 0 5291m 198m 6556 S 16.5 0.6 2496:41 mysqld
14399 apache 20 0 497m 36m 19m S 14.5 0.1 0:02.47 httpd
4531 apache 20 0 497m 36m 19m S 13.5 0.1 0:02.87 httpd
30943 apache 20 0 497m 35m 19m S 6.6 0.1 0:03.29 httpd
10267 apache 20 0 175m 13m 12m R 4.9 0.0 0:00.15 rrdtool

above is extract from TOP command, on some instant rrdtool/httpd taking very high CPU and sometime very less.

please note this happened now only and it was working great on old Centos old NagiosXI , so no need to upgrade it
CentOS release 6.6 (Final)
Nagios XI 5.2.7

Is there any general topic/guideline to check why? who is taking to much process?
Is there any issue with DataStorage? How to find?

Regards

Re: high CPU, database connection error, blue Nagvis map

Posted: Wed Dec 26, 2018 12:45 pm
by cdienger
rrdtool is used for performance graphs. If one hasn't been set up already, I would recommend a RAM disk which moves much of the processing to memory to improve performance - https://assets.nagios.com/downloads/nag ... giosXI.pdf

https://assets.nagios.com/downloads/nag ... ios-XI.pdf has some additional things that can help to improve performance.

As far as general things to check:

logs:
/var/log/messages
/var/log/httpd/*
/var/log/mysqdl.log
/usr/local/nagios/var/*.log


processes(while load is high):
top command ouptut
"ps aux"