Page 1 of 2

Alerts not clearing correctly

Posted: Thu May 30, 2013 8:34 am
by isadmin
Our EMC CX4 alerted the other day but when I checked there where no alerts on the SAN.
The status shows everything ok except SPS faulted on both SP's.
If I run the manual check i receive
./check_emc_clariion.pl -H [ip address] -u ******* -p ****** -t faults
The array is operating normally.
I have deleted files from the tmp directory and rebooted with no change showing in the web interface under ALL SERVICE PROBLEMS
Thanks

Re: Alerts not clearing correctly

Posted: Thu May 30, 2013 12:06 pm
by isadmin
Reviewing the SPS's they are functioning normally

Re: Alerts not clearing correctly

Posted: Thu May 30, 2013 1:08 pm
by abrist
What is the output for the check on the service details page in xi?

Re: Alerts not clearing correctly

Posted: Thu May 30, 2013 1:41 pm
by isadmin
Service detail page results

Critical
SP A Present,Power ok,Power ok,SPS failed,Cabling ok

But on the SAN Unisphere under hardware the SPS's have no alerts and are working correctly

Re: Alerts not clearing correctly

Posted: Thu May 30, 2013 2:17 pm
by isadmin
Now EMC does do SPS failure checks every weekend in which they are bounced to test them which triggered the original alerts but it has done this before without
hanging the alerting.

Re: Alerts not clearing correctly

Posted: Thu May 30, 2013 2:49 pm
by abrist
This is strange. What does the command you configured in XI look like?

Re: Alerts not clearing correctly

Posted: Fri May 31, 2013 9:14 am
by isadmin
check_emc_clariion!$HOSTADDRESS$!-u *******!-p **********!-t sp!--sp A!!!

Re: Alerts not clearing correctly

Posted: Fri May 31, 2013 9:18 am
by isadmin
results from commandline
./check_emc_clariion.pl -H IPADDRESS -u ******* -p ******** -t sp --sp A
SP A Present,Power ok,SPS ok,Cabling ok
Something on the Web Interface side is not updating correctly

Re: Alerts not clearing correctly

Posted: Fri May 31, 2013 9:46 am
by abrist
How big is this script? Could you post it? It seems really odd that only 1 of the 4 or so checks are behaving differently. Is the SPS check the only one that requires a password? Do you have any special, non alphanumeric, nasty meta chars in your username or password?

Re: Alerts not clearing correctly

Posted: Fri May 31, 2013 10:13 am
by isadmin
No username or password issue and I had tested this script for a few months with Troy ([email protected]) now [email protected]
The script is working fine it is the Web Portion of Nagios not accurately resetting/updating or getting the data.
I have even rebuilt the DB thinking maybe there was an error.
The script is from the EMC Wizard and is pretty large to post.
Version : 2013-02-09
# Date : February 9th 2013