Is there a page on Nagios that will tell me why it went down?
A server goes down
-
ancovington
- Posts: 175
- Joined: Fri Sep 13, 2013 1:32 pm
A server goes down
One of my VM hosts went down!
Is there a page on Nagios that will tell me why it went down?
Is there a page on Nagios that will tell me why it went down?
Re: A server goes down
It can definitely tell you *that* it went down, but the *why* is a lot harder. If a host goes down it can't really send any data to Nagios, and aside from state and perfdata Nagios doesn't have much to go by. Try taking a look at the data just before it went down and see if there is any indication of a problem (high CPU, slow disk, etc).
Former Nagios employee
-
ancovington
- Posts: 175
- Joined: Fri Sep 13, 2013 1:32 pm
Re: A server goes down
Thank you. I don't think management will be too happy knowing that we bought a tool that really doesn't tell us what's going on, but we will try to make do. Thank you again.
-
slansing
- Posts: 7698
- Joined: Mon Apr 23, 2012 4:28 pm
- Location: Travelling through time and space...
Re: A server goes down
It would be hard for a host to tell you what went wrong when it is offline.. Perhaps you should set up some checks that are designed to warn you properly before an issue occurs. For instance, you might want to lower your warning threshold on certain checks. Depending on why the VM went down, I would focus on a plugin, or check that can monitor that. Are you a sysadmin? Or do you have any sysadmins in the house that can look into the syslogs or kernel messages so you can tell why it went down? Did someone just shut the VM down?
If you are only monitoring memory, partition space, cpu, etc, and you were not warned for those checks, either something happened IMMEDIATELY, or you have your thresholds far off from what they should be. Another possibility is, as I noted above, you are not running any checks that would be of value for what caused that VM to crash, maybe it had nothing to do with CPU, or Mem, or share space.
If you are only monitoring memory, partition space, cpu, etc, and you were not warned for those checks, either something happened IMMEDIATELY, or you have your thresholds far off from what they should be. Another possibility is, as I noted above, you are not running any checks that would be of value for what caused that VM to crash, maybe it had nothing to do with CPU, or Mem, or share space.
Re: A server goes down
There isn't a tool in the world that is capable of telling you why everything in your network happens. If there was then 70% of tech support would be out of business. It's not a software shortcoming, it's a technology restraint. Sorry if this is a bit morbid, but you can compare the situation to an autopsy. To determine cause of death you can't rely on the person telling you why they died; you have to have a professional diagnosis. To keep the analogy going, if the person calls you every day and says "I'm not feeling well", then you can start to get an idea of what might be wrong but you can't know for sure without going to a doctor.ancovington wrote:Thank you. I don't think management will be too happy knowing that we bought a tool that really doesn't tell us what's going on, but we will try to make do. Thank you again.
Some things are obvious like "Disk Usage 95%" or "CPU Load 5, 6, 6" but if a server just unexpectedly stops you have to take a look at the logs.
Former Nagios employee
-
ancovington
- Posts: 175
- Joined: Fri Sep 13, 2013 1:32 pm
Re: A server goes down
Tank you. After working with support Friday, the Nagios Server is only seeing one cpu, but 4 was added to the OS at the start of the build. Could you please explain why this is?
-
slansing
- Posts: 7698
- Joined: Mon Apr 23, 2012 4:28 pm
- Location: Travelling through time and space...
Re: A server goes down
What do you mean 4 were added to your OS? Are you running a VM? Did you make sure to properly add the new hardware allocations with the instructions VMware provides?
Re: A server goes down
What does ESX report? Is the vm still provisioned 4 cores?
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
-
ancovington
- Posts: 175
- Joined: Fri Sep 13, 2013 1:32 pm
Re: A server goes down
Yes the vm is still provisioned for 4 cores.
-
slansing
- Posts: 7698
- Joined: Mon Apr 23, 2012 4:28 pm
- Location: Travelling through time and space...
Re: A server goes down
Are you sure it was not 4 cores that were provisioned when the VM was created, and not 4 CPUs? It sounds like you provisioned 1 CPU, 4 cores.