CPU load || Fake Alert
Posted: Thu Jul 19, 2018 12:49 am
As checked, we are getting fake alerts for few node.
[root@nagiosxi libexec]# su - nagios
[nagios@nagiosxi ~]$ cd /usr/local/nagios/libexec
[nagios@nagiosxi libexec]$ ./check_snmp_load_wizard.pl -H HCSMS1 -C public --v2c -w 90 -c 95 -T netsc -f CPU used 100.0% (>95) : CRITICAL | cpu_prct_used=100%;90;95
[nagios@nagiosxi libexec]$ date
Thu Jul 19 12:00:47 ICT 2018
[nagios@nagiosxi libexec]$ ssh HCSMS1
Last login: Wed Jul 18 12:21:03 2018 from 10.10.164.52
[nagios@HCSMS1 ~]$ uptime
11:55:35 up 905 days, 15:57, 2 users, load average: 0.86, 0.69, 0.61
[nagios@HCSMS1 ~]$ date
Thu Jul 19 11:55:38 ICT 2018
[nagios@HCSMS1 ~]$
[nagios@HCSMS1 ~]$
[nagios@HCSMS1 ~]$ logout
Connection to HCSMS1 closed.
[nagios@nagiosxi libexec]$ ./check_snmp_load_wizard.pl -H HCSMS1X4 -C public --v2c -w 90 -c 95 -T netsc -f
CPU used 100.0% (>95) : CRITICAL | cpu_prct_used=100%;90;95
[nagios@nagiosxi libexec]$ ssh HCSMS1X4
Last login: Wed Jun 13 18:16:52 2018 from 10.10.164.52
[nagios@hcsms1x4 ~]$ uptime
12:02:26 up 735 days, 10:56, 2 users, load average: 0.38, 0.42, 0.31
[nagios@hcsms1x4 ~]$ date
Thu Jul 19 12:02:28 ICT 2018
[nagios@hcsms1x4 ~]$
we are getting 100% CPU load in the alert window. As checked, the load is normal. Please suggest. This check works fine for all the nodes except these nodes. It is considering idle CPU. Please suggest the way forward to get rid of the same.
[root@nagiosxi libexec]# su - nagios
[nagios@nagiosxi ~]$ cd /usr/local/nagios/libexec
[nagios@nagiosxi libexec]$ ./check_snmp_load_wizard.pl -H HCSMS1 -C public --v2c -w 90 -c 95 -T netsc -f CPU used 100.0% (>95) : CRITICAL | cpu_prct_used=100%;90;95
[nagios@nagiosxi libexec]$ date
Thu Jul 19 12:00:47 ICT 2018
[nagios@nagiosxi libexec]$ ssh HCSMS1
Last login: Wed Jul 18 12:21:03 2018 from 10.10.164.52
[nagios@HCSMS1 ~]$ uptime
11:55:35 up 905 days, 15:57, 2 users, load average: 0.86, 0.69, 0.61
[nagios@HCSMS1 ~]$ date
Thu Jul 19 11:55:38 ICT 2018
[nagios@HCSMS1 ~]$
[nagios@HCSMS1 ~]$
[nagios@HCSMS1 ~]$ logout
Connection to HCSMS1 closed.
[nagios@nagiosxi libexec]$ ./check_snmp_load_wizard.pl -H HCSMS1X4 -C public --v2c -w 90 -c 95 -T netsc -f
CPU used 100.0% (>95) : CRITICAL | cpu_prct_used=100%;90;95
[nagios@nagiosxi libexec]$ ssh HCSMS1X4
Last login: Wed Jun 13 18:16:52 2018 from 10.10.164.52
[nagios@hcsms1x4 ~]$ uptime
12:02:26 up 735 days, 10:56, 2 users, load average: 0.38, 0.42, 0.31
[nagios@hcsms1x4 ~]$ date
Thu Jul 19 12:02:28 ICT 2018
[nagios@hcsms1x4 ~]$
we are getting 100% CPU load in the alert window. As checked, the load is normal. Please suggest. This check works fine for all the nodes except these nodes. It is considering idle CPU. Please suggest the way forward to get rid of the same.