Dell R910

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Scratchmang
Posts: 7
Joined: Thu Nov 04, 2021 8:05 pm

Dell R910

Post by Scratchmang »

So Apparantly I have access to this forum for the next 30 Days. I'm hoping it take a lot less time to figure out my issue, so here go...

Copy and pasted from the Community Support forums..

===========================================
Hey everyone,

OK, so first off, I'm a windows guy at heart, so a lot of the configuring and adding of plugins, 'pip' etc are rather confusing to me (And I'll not even get into the horror of 'vi') but there you go, so please, type really slowly when replying as I'm slow ok :P

So here's the deal, I've got a Dell R910 in the house for my own entertainment, 4 Processors, 256 Gig of ram and 8 HDD's in a RAID 5 Configuration running ESXi 6.5. There's and iDRAC v6 in there too. About a month ago one of the processors went south, I say 'about' as with nothing monitoring it I have no idea when it really went, I just noticed the front panel flashing orange at me the other day when I was in the 'Server Room'. Long story short I replaced the CPU and all is well but it got me to thinking that I really need something to monitor the CPU's, Memory and more importantly the 8 individual HDD's in case one of them fails sometime in the future. Of course if I get this all running it will never happen but that's a whole different battle with Murphy and his Law's....

Anyway I downloaded the NagiosXI ova (nagiosxi-5.8.7-64.ova) file and installed it on the Server and got to configuring. Managed to get it doing 'some' SNMP monitoring but it's not really giving me what I want which as mentioned is checking the CPU's, memory and Disks in the RAID array. I've downloaded the Dell R910 plugin and all the associated MIB's tried doing SNMP walks, adding whatever I could, but to no avail. I'm at the limits of my technical expertise for this. SO I'm turning tot he 'Hive Mind'; here in the hopes that someone may have some insight as to how to get it to show me what I'm after....

For simplicity sake, here's a bit of a breakdown of the setup, which in and oif itself may be wrong, who knows...

Dell R910, ESXi Server on IP - 192.168.42.127
vCenter Server on IP - 192.1368.42.128
Dell iDRAC v6 on IP - 192.168.42.129
-- From what I can tell the iDRAC is not the enterprise version, or at least not registered / licensed / activate as such.
NagiosXI Install on IP - 192.168.42.15

Installed Modules (Plugins?)
Dell_EMC_OpenManage_Plugin_v3.1_Nagios_XI_A00.tar.gz
- Contained - Dell_EMC_OM_NagiosXI_monitoring_wizard.zip - A Dell specific monitoring plugin
VMware-vSphere-Perl-SDK-7.0.0-17698549.x86_64.tar.gz - This I had to ''pip' in it was successful but did not seem to help at all.
dellopenmanage.zip - A Dell specific monitoring plugin
Dell-OM-MIBS-850_A00.zip (MIB's)
DCMIB65.zip (Another set of MIB's)

I've pointed NagiosXI at each of theses IP's individually using the Configuration Modules of "SNMP" and the 'DELL iDRAC' with limited success, some things report back, like Uptime (Really useful) and responses back form the vCenter IP will show me the different VM's I have running, but that's really not what I'm after here. Really all I want is CPU 1-4 OK or Bad, Memory slot "X" Good or Bad, and Physical Disk 1-8 Good or Bad.

I have been playing around with some other SNMP Monitoring solutions, most notable an On-Line one called Site24x7 which did give me a breakdown of all the individual drives as to how I'm unsure, so I know it is possible, however do to the costs of Site24x7 I would prefer to use NagiosXI if at all possible.

So, and thoughts or insights form the Hive mind? And please remember type slowly when you reply eh?

Thanks in advance.

Scratch
Alberta, Canada.
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: Dell R910

Post by gsmith »

Hey there,

We'll get you up and running. A few questions:

1. Are these four items all on one physical machine:
Dell R910, ESXi Server on IP - 192.168.42.127
Dell iDRAC v6 on IP - 192.168.42.129
vCenter Server on IP - 192.168.42.128
NagiosXI Install on IP - 192.168.42.15

2. My understanding is that the iDRAC sits on the motherboard and is used to manage the server,
is that correct? I'm not sure if we have to monitor it but we can save this guy for last.

3. Looks like you were trying to use SNMP to monitor things, is that a requirement or can we install
a Nagios software agent on the 192.168.42.127 node?

4. What OS is the NagiosXI image running on?

We should be OK with monitoring the vCenter node if you were able to get the perl SDK
installed on the Nagios XI server.

If you are running a trial version you could request a QuickStart session, which is a Webex
meeting for an hour. You can ask questions, and we could walk you through setting some
of this up.

Or you can try to get things running via this forum, and save the QuickStart in case
you really get stumped.
Scratchmang
Posts: 7
Joined: Thu Nov 04, 2021 8:05 pm

Re: Dell R910

Post by Scratchmang »

Hey,

So in answer to your questions....

1 - Are these four items all on one physical machine:
Yes, they are all one single machine. The Dell R910 is pretty much the only system I have.

2 - My understanding is that the iDRAC sits on the motherboard
Yes, that is correct, it's an older version (v6) I think they are up to v9 now. I'd have to verify.

3 - Looks like you were trying to use SNMP to monitor things.
Yes, it's pretty much the only option as far as I can tell to monitor the hardware as it's ESXi that talks to / controls the hardware, everything else (The Windows and Linux Machines) only talk to ESXi.

4 - What OS is the NagiosXI image running on?
CentOS I believe. Whatever OS in contained within the nagiosxi-5.8.7-64.ova image...


-- "We should be OK with monitoring the vCenter node"
from what I have been discovering, that might be the way to go. I've been farting around with "Site24x7" that a buddy recommended but I'm so not happy with SaaS. However it did find all my HDD's and individual CPU's etc. And as a bonus it actually had what I believe to be OID's? Or at least partial OID's?

Disk Drive Bay 1 Drive 12 --- 0.26.1.140
Disk Drive Bay 1 Drive 13 --- 0.26.1.141

Processor 1 Status 0 --- 0.3.1.96
Processor 1 Presence 1 --- 0.3.1.80

Not sure if that helps at all....
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: Dell R910

Post by gsmith »

Hi

Sounds like you installed the VMWare SDK. Please take a look at this:
https://assets.nagios.com/downloads/nag ... ios-XI.pdf

Did you follow these instructions?

If not please do so.

To find out what version of Linux Nagios is running on please enter this command on a Linux
CLI (command line interface):

Code: Select all

sudo cat /etc/os-release
This will let us know which SDK we need installed (page 3 of the above document)

It's all Linux stuff so let me know if you need help with any of it.

Thanks
Scratchmang
Posts: 7
Joined: Thu Nov 04, 2021 8:05 pm

Re: Dell R910

Post by Scratchmang »

Hey gsmith,

So I followed the instructions as per your recommendation. It appears to be working, but it's not what I'm trying to monitor. That VMWare plugin gives me an overview of the Host itself, so things like "Overall" CPU Usage and "Overall" Datastore usage, which although useful, as I am the only one who actually adds VM's to the host it's not really all that useful to me.

What I am more interested in is monitoring the Hardware that resides below the OS Level, so the state of individual Drives, DIMM's and Physical HDD's

What I really want to see is something similar this (If this makes sense)

Physical Hard Drives
drives.png
Processors
Processors.png
So everything at a hardware level, from Power supplies, memory, Fans the works really, like this...
All-Hardware.png
As to the OS Version....

NAME="CentOS Linux"
VERSION="7 (Core)"
ID="centos"
ID_LIKE="rhel fedora"
VERSION_ID="7"
PRETTY_NAME="CentOS Linux 7 (Core)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:centos:centos:7"
HOME_URL="https://www.centos.org/"
BUG_REPORT_URL="https://bugs.centos.org/"

CENTOS_MANTISBT_PROJECT="CentOS-7"
CENTOS_MANTISBT_PROJECT_VERSION="7"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="7"
You do not have the required permissions to view the files attached to this post.
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: Dell R910

Post by gsmith »

Hi

OK, then it looks like it's going to be snmp ;)

Unzip the mib files on your local machine (the machine you run your web browser from to connect to
the Nagios gui.

Dell-OM-MIBS-850_A00.zip (MIB's)
DCMIB65.zip (Another set of MIB's)

Access Nagios via the web browser.
Follow this video to load the mib files:
import_mib.zip
Next we'll use the snmpwalk wizard to access 192.168.42.127, which should give us a ton of
metrics on the physical machine (Dell R910, ESXi Server). Follow this video:
snmpwalk.zip
When you get through that let me know what results you get please.

Thanks
You do not have the required permissions to view the files attached to this post.
Scratchmang
Posts: 7
Joined: Thu Nov 04, 2021 8:05 pm

Re: Dell R910

Post by Scratchmang »

Sorry for the long over due reply...

SNMP Walk on 192.168.42.127 - Returns nothing.
SNMP Walk on 192.168.42.128 - Returns nothing.

SNMP Walk on 192.168.42.129 returns about 11 lines of "DELL-RAC-MIB" such as the following...

String
DELL-RAC-MIB
drsProductShortName.0
STRING "iDRAC6"
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: Dell R910

Post by gsmith »

Hi

From the Nagios server please use the CLI to run:

Code: Select all

nmap  192.168.42.127 -sU -p 161
nmap  192.168.42.128 -sU -p 161
Please send me the outout.

Thanks
Scratchmang
Posts: 7
Joined: Thu Nov 04, 2021 8:05 pm

Re: Dell R910

Post by Scratchmang »

[root@localhost ~]# nmap 192.168.42.127 -sU -p 161

Starting Nmap 6.47 ( http://nmap.org ) at 2021-11-23 08:48 MST
Nmap scan report for ******.******.*** (192.168.42.127)
Host is up (0.00048s latency).
PORT STATE SERVICE
161/udp closed snmp
MAC Address: D4:AE:52:78:7C:61 (Dell)

============================

OK, This is strange to say the least, as when I check from another machine (Ubuntu on the same subnet) I get this as a result.....

nmap -Pn 192.168.42.127 -sU -p 161

Starting Nmap 7.60 ( https://nmap.org ) at 2021-11-23 15:51 UTC
Nmap scan report for 192.168.42.127
Host is up.

PORT STATE SERVICE
161/udp open|filtered snmp

Nmap done: 1 IP address (1 host up) scanned in 2.16 seconds


So odd as it seems, NAGIOS sees the SNMP Port as closed, where other devices on the same subnet see it as open?


====================================================================================
====================================================================================


Nmap done: 1 IP address (1 host up) scanned in 6.57 seconds
[root@localhost ~]# nmap 192.168.42.128 -sU -p 161

Starting Nmap 6.47 ( http://nmap.org ) at 2021-11-23 08:48 MST
Nmap scan report for ******.******.*** (192.168.42.128)
Host is up (0.00058s latency).
PORT STATE SERVICE
161/udp open|filtered snmp
MAC Address: 00:0C:29:46:0C:A1 (VMware)

Nmap done: 1 IP address (1 host up) scanned in 6.78 seconds
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: Dell R910

Post by gsmith »

Hi

On 192.168.42.127 can you check the SNMP configuration? It may be only
allowing connections from certain hosts


Thanks
Locked