Page 1 of 1
ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Wed Dec 28, 2016 12:37 am
by strmon
Hi support,
After successful integration of ESX{i} host using vmware wizard we are trying now to monitor all VMS running under that particular hosts. we are facing strange response :
for some we are getting below error:-
RES1:
ESX3 CRITICAL - HOST-VM CPU Unknown error
But for some we are getting:
RES2:
ESX3 CRITICAL - VMware machine "VMXX" is not running. Current state is "poweredOff"
My Q is: how RES1 differs with RES2?
: RES2 should be in state "OK" as machine is powered off.
:How to fix both?
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Wed Dec 28, 2016 10:20 am
by rkennedy
We'll need a few things to look further into this.
- Could you show us a screenshot of the exact errors?
- Also post your service definitions for the affected ones?
- We'll also need the command definition associated with the check_commands.
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Fri Dec 30, 2016 1:57 am
by strmon
Details Against error: UNKNOWN ERROR:
Attachment file:VM-UNKNOWN.pNG
define command {
command_name check_esx3_guest
command_line $USER1$/check_esx3.pl -t 60 -H "$HOSTADDRESS$" -f "$ARG1$" -N "$ARG2$" -l "$ARG3$" $ARG4$
}
ARG1:/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt
ARG2:CDOT8.3.1-1
ARG3:CPU
Details Against error: Critical error:
Attachment file:Critical-VMS.PNG
define command {
command_name check_esx3_guest
command_line $USER1$/check_esx3.pl -t 60 -H "$HOSTADDRESS$" -f "$ARG1$" -N "$ARG2$" -l "$ARG3$" $ARG4$
}
ARG1:/usr/local/nagiosxi/etc/components/vmware/usmmisadm002_auth.txt
ARG2:usmmisadm009
ARG3:CPU
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Fri Dec 30, 2016 12:36 pm
by tgriep
This message
ESX3 CRITICAL - VMware machine "VMXX" is not running. Current state is "poweredOff"
If the guest if powered off, the check will return a Critical error which is normal.
There is not an option to ignore Powered Off guests so for now, you would have to stop checking it.
Can I have you run the command in verbose mode for the unknown error so we can see if we get more details on the issue?
Login to the XI server, run the following commands and post the output.
Code: Select all
/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-1" -l "CPU" -v
/usr/local/nagios/libexec/check_esx3.pl --help
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Tue Jan 03, 2017 2:42 am
by strmon
Here we got the O/p :-
/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-1" -l "CPU" -v
ESX3 OK - "CDOT8.3.1-1" cpu usage=124.00 MHz(2.39%) wait=18883.00 ms | cpu_usagemhz=124.00Mhz;; cpu_usage=2.39%;; cpu_wait=18883.00ms;;
Strange thing is: i am getting (+)ve o/p even when i try same command from "core config manager>>service>>test_command" but failed if i do "Force check immediately".
Just to mention: currently i am running out of license and integrated numb of nodes are more than the license limit. Not sure if its making any efffet?
Attaching all definition snapshots again.
Command_Deiniation.PNG
Test_cmdOutput.PNG
Service_definiation.PNG
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Tue Jan 03, 2017 11:27 am
by tgriep
The licensing issue could be it. It may not be running the check as it is over the limit.
Also, I think the command I had you run was for the wrong guest. Try this one from a command prompt and see if this one fails.
Code: Select all
/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-2" -l "CPU" -v
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Wed Jan 04, 2017 5:35 am
by strmon
Yes, it result error now:
/usr/local/nagios/libexec/check_esx3.pl -t 60 -H xxx.xxx.xxx.xxx -f "/usr/local/nagiosxi/etc/components/vmware/usmdisadm001_auth.txt" -N "CDOT8.3.1-2" -l "CPU" -v
ESX3 CRITICAL - HOST-VM CPU Unknown error
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Wed Jan 04, 2017 11:46 am
by tgriep
Can you run the following and post the output so we can see what version of the check_esx3.pl plugin you are running?
Code: Select all
/usr/local/nagios/libexec/check_esx3.pl -V
Also, can you describe any of the differences between a guest that works and one that doesn't?
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Thu Jan 05, 2017 3:23 am
by strmon
/usr/local/nagios/libexec/check_esx3.pl -V
check_esx3.pl 0.2.1
Re: ESX3 CRITICAL - VMware machine "VMXX" is not running
Posted: Thu Jan 05, 2017 12:02 pm
by tgriep
You could try and upgrade to a newer version of the plugin to see if that fixes the issue. I have heard reports that after the upgrade, this issue was resolved but it is not guaranteed in all combinations.
To upgrade the plugin, follow these instrustions.
First, login to the XI server as root and make a backup of the existing one by running the following.
Code: Select all
cp /usr/local/nagios/libexec/check_esx3.pl /usr/local/nagios/libexec/check_esx3.pl.old
Then run the following to install a required Perl Module for the new plugin.
Then download this updated plugin to your PC.
https://github.com/shinken-monitoring/p ... ck_esx3.pl
After it is downloaded, login to the XI GUI and go to the Admin > Manage Plugins menu
Browse to the new plugin and then click on the Upload Button.
Wait for the next check to run in the XI GUI to see if this fixes the issue or you can run the command manually.