Page 1 of 1

Inconsistant results from check_esx3_host

Posted: Fri Nov 15, 2019 12:46 pm
by Matthew.Cary
I have multiple VMWare hosts being monitored by Nagios XI & for various reasons I am monitoring the same datastore from each of them.

I've noticed that the same datastore, when checked from different hosts, is showing wildly different statistics.

VMWare Host 1:
[NAGIOSHOST ~]$ /usr/local/nagios/libexec/check_esx3.pl -H "prdoracle-esxi1.company.com" -f "/usr/local/nagiosxi/etc/components/vmware/esxi_6_auth.txt" -l "VMFS" -w 15%: -c 5%:
CHECK_ESX3.PL WARNING - Storages : 'ntap_n2_1200_SAS_ORACLE_01'(free)=438330.21 MB (6.97%), 'ntap_n1_SATA_iso'(free)=897752.47 MB (85.62%) | ntap_n2_1200_SAS_ORACLE_01=6.97%;15:;5: ntap_n1_SATA_iso=85.62%;15:;5:

[NAGIOSHOST ~]$ /usr/local/nagios/libexec/check_esx3.pl -H "prdoracle-esxi2.company.com" -f "/usr/local/nagiosxi/etc/components/vmware/esxi_6_auth.txt" -l "VMFS" -w 15%: -c 5%:
CHECK_ESX3.PL OK - Storages : 'ntap_n2_1200_SAS_ORACLE_01'(free)=2437531.50 MB (29.06%), 'ntap_n1_SATA_iso'(free)=907882.21 MB (86.58%) | ntap_n2_1200_SAS_ORACLE_01=29.06%;15:;5: ntap_n1_SATA_iso=86.58%;15:;5:

These are the same host, taken seconds apart, and they don't even agree on the total size of the volume.
If I look at the volume using VSPHERE each host reports the same space in use/free.

What am I missing?

Re: Inconsistant results from check_esx3_host

Posted: Fri Nov 15, 2019 3:56 pm
by mbellerue
That is very interesting. I will have to try to replicate that on our side. I'm not sure if it matters, but just in case, how is the storage hooked up to the ESXi servers? Is it like an FC connection, or iSCSI or something?

Re: Inconsistant results from check_esx3_host

Posted: Mon Nov 18, 2019 10:33 am
by Matthew.Cary
These are Netapp Luns connected to the ESXI Hosts via twinax/fiber but it’s a nfs datastore

Re: Inconsistant results from check_esx3_host

Posted: Mon Nov 18, 2019 5:24 pm
by ssax
Could you be hitting this?

Code: Select all

https://kb.vmware.com/s/article/52985
The only other thing I can think of is different block size on each system:

What is the output of this command on both systems?

Code: Select all

fdisk -l
What is the exact version of the plugin you are running?

Code: Select all

/usr/local/nagios/libexec/check_esx3.pl -V