ESX3 CRITICAL - VMware machine "VMXX" is not running

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
strmon
Posts: 31
Joined: Mon Dec 26, 2016 3:42 am

ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by strmon »

Hi support,

After successful integration of ESX{i} host using vmware wizard we are trying now to monitor all VMS running under that particular hosts. we are facing strange response :

for some we are getting below error:-
RES1:
ESX3 CRITICAL - HOST-VM CPU Unknown error
But for some we are getting:
RES2:
ESX3 CRITICAL - VMware machine "VMXX" is not running. Current state is "poweredOff"

My Q is: how RES1 differs with RES2?
: RES2 should be in state "OK" as machine is powered off.
:How to fix both?
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by rkennedy »

We'll need a few things to look further into this.
- Could you show us a screenshot of the exact errors?
- Also post your service definitions for the affected ones?
- We'll also need the command definition associated with the check_commands.
Former Nagios Employee
strmon
Posts: 31
Joined: Mon Dec 26, 2016 3:42 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by strmon »

Details Against error: UNKNOWN ERROR:
Attachment file:VM-UNKNOWN.pNG
define command {
command_name check_esx3_guest
command_line $USER1$/check_esx3.pl -t 60 -H "$HOSTADDRESS$" -f "$ARG1$" -N "$ARG2$" -l "$ARG3$" $ARG4$
}

ARG1:/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt
ARG2:CDOT8.3.1-1
ARG3:CPU

Details Against error: Critical error:
Attachment file:Critical-VMS.PNG
define command {
command_name check_esx3_guest
command_line $USER1$/check_esx3.pl -t 60 -H "$HOSTADDRESS$" -f "$ARG1$" -N "$ARG2$" -l "$ARG3$" $ARG4$
}
ARG1:/usr/local/nagiosxi/etc/components/vmware/usmmisadm002_auth.txt
ARG2:usmmisadm009
ARG3:CPU
You do not have the required permissions to view the files attached to this post.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by tgriep »

This message
ESX3 CRITICAL - VMware machine "VMXX" is not running. Current state is "poweredOff"
If the guest if powered off, the check will return a Critical error which is normal.
There is not an option to ignore Powered Off guests so for now, you would have to stop checking it.

Can I have you run the command in verbose mode for the unknown error so we can see if we get more details on the issue?
Login to the XI server, run the following commands and post the output.

Code: Select all

/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-1" -l "CPU" -v
/usr/local/nagios/libexec/check_esx3.pl --help
Be sure to check out our Knowledgebase for helpful articles and solutions!
strmon
Posts: 31
Joined: Mon Dec 26, 2016 3:42 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by strmon »

Here we got the O/p :-
/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-1" -l "CPU" -v
ESX3 OK - "CDOT8.3.1-1" cpu usage=124.00 MHz(2.39%) wait=18883.00 ms | cpu_usagemhz=124.00Mhz;; cpu_usage=2.39%;; cpu_wait=18883.00ms;;
Strange thing is: i am getting (+)ve o/p even when i try same command from "core config manager>>service>>test_command" but failed if i do "Force check immediately".

Just to mention: currently i am running out of license and integrated numb of nodes are more than the license limit. Not sure if its making any efffet?

Attaching all definition snapshots again.
Command_Deiniation.PNG
Test_cmdOutput.PNG
Service_definiation.PNG
You do not have the required permissions to view the files attached to this post.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by tgriep »

The licensing issue could be it. It may not be running the check as it is over the limit.
Also, I think the command I had you run was for the wrong guest. Try this one from a command prompt and see if this one fails.

Code: Select all

/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-2" -l "CPU" -v
Be sure to check out our Knowledgebase for helpful articles and solutions!
strmon
Posts: 31
Joined: Mon Dec 26, 2016 3:42 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by strmon »

Yes, it result error now:

/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-2" -l "CPU" -v
ESX3 CRITICAL - HOST-VM CPU Unknown error
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by tgriep »

Can you run the following and post the output so we can see what version of the check_esx3.pl plugin you are running?

Code: Select all

/usr/local/nagios/libexec/check_esx3.pl -V
Also, can you describe any of the differences between a guest that works and one that doesn't?
Be sure to check out our Knowledgebase for helpful articles and solutions!
strmon
Posts: 31
Joined: Mon Dec 26, 2016 3:42 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by strmon »

/usr/local/nagios/libexec/check_esx3.pl -V
check_esx3.pl 0.2.1
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: ESX3 CRITICAL - VMware machine "VMXX" is not running

Post by tgriep »

You could try and upgrade to a newer version of the plugin to see if that fixes the issue. I have heard reports that after the upgrade, this issue was resolved but it is not guaranteed in all combinations.

To upgrade the plugin, follow these instrustions.

First, login to the XI server as root and make a backup of the existing one by running the following.

Code: Select all

cp /usr/local/nagios/libexec/check_esx3.pl /usr/local/nagios/libexec/check_esx3.pl.old
Then run the following to install a required Perl Module for the new plugin.

Code: Select all

yum install perl-Nagios-Plugin
Then download this updated plugin to your PC.
https://github.com/shinken-monitoring/p ... ck_esx3.pl

After it is downloaded, login to the XI GUI and go to the Admin > Manage Plugins menu
Browse to the new plugin and then click on the Upload Button.

Wait for the next check to run in the XI GUI to see if this fixes the issue or you can run the command manually.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked