CPU load triggered all linux machines

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
reincarne
Posts: 146
Joined: Wed Jun 26, 2013 4:39 am

CPU load triggered all linux machines

Post by reincarne »

Hi,
I have a weird problem with the CPU load check. Once it is triggered on the Nagios machine, almsot all Linux machines are triggered and alerting on high CPU load.
I'm sure there is no real problem, but something related to a sort of bug or bad configurations.


Thanks in advance.
User avatar
BanditBBS
Posts: 2474
Joined: Tue May 31, 2011 12:57 pm
Location: Scio, OH
Contact:

Re: CPU load triggered all linux machines

Post by BanditBBS »

What command do you have setup for checking the CPU_Load on the remote machines? Are you using the check_cpu or whatever? If so, that's checking the local machine only, even though you think it is checking the remote machines. You need to be using nrpe or some other agent to check the remote machines.
2 of XI5.6.14 Prod/DR/DEV - Nagios LogServer 2 Nodes
See my projects on the Exchange at BanditBBS - Also check out my Nagios stuff on my personal page at Bandit's Home and at github
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: CPU load triggered all linux machines

Post by sreinhardt »

Sounds like bandit is right on as usual. Are you using nrpe or another command to check the remote systems? Do all the systems have the same load, which also happens to be the same as the nagios machine?
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
reincarne
Posts: 146
Joined: Wed Jun 26, 2013 4:39 am

Re: CPU load triggered all linux machines

Post by reincarne »

sreinhardt wrote:Sounds like bandit is right on as usual. Are you using nrpe or another command to check the remote systems? Do all the systems have the same load, which also happens to be the same as the nagios machine?
I will check it.
But from what I see now, I'm checking the load on Nagios machine using the "check_local_load", which make sense as it checks the local cpu load.
As about the rest of the servers, I'm using check_nrpe --> check_load.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: CPU load triggered all linux machines

Post by slansing »

Your other service checks for load should not trigger based on another one unless this is set up, its likely coincidence.
reincarne
Posts: 146
Joined: Wed Jun 26, 2013 4:39 am

Re: CPU load triggered all linux machines

Post by reincarne »

slansing wrote:Your other service checks for load should not trigger based on another one unless this is set up, its likely coincidence.
The only reasonable thing I came with which triggers the CPU load on all the Linux machine is something that might be related to a network or sort of service which triggers it.
Anyways, I will check that.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: CPU load triggered all linux machines

Post by slansing »

That is what I would suspect, were they going into a state that had nothing to do with CPU usage, say, throwing an out of bounds error?
Locked