Page 1 of 1
Problem of fault alert
Posted: Tue Nov 29, 2016 6:20 am
by larry.lo
Hi Support,
We found that there are some fault alert appeared in our hosts, these fault alert were happened before.
It is a critical bug in Nagios, could you please advise the solution for us to fix it?
For example:
1st alert (original) was found on 2016/08/09, but 2nd & 3rd alert (fault) were found on 2016/08/19 and 2016/11/28.
These alerts are same and should be appear once, please see the following screenshot.
FaultAlert.png
Re: Problem of fault alert
Posted: Tue Nov 29, 2016 11:40 am
by rkennedy
I believe this is an issue with how the plugin is configured, could you please post the service definition for us to look at?
Re: Problem of fault alert
Posted: Tue Nov 29, 2016 12:00 pm
by gormank
He's looking at the state history report, which is a list of states over time.
Re: Problem of fault alert
Posted: Tue Nov 29, 2016 3:51 pm
by avandemore
To be a bit more pedantic, the State History Report is a list of state transitions.
The state of that service obviously changed over time, so we'd like to see the service definition. This will tell us directly or indirectly on what can cause the object to change state.
Roughly the same type of info can be accessed with
http://yournagios/nagios/jsonquery.html > Archive > Statechangelist.
Re: Problem of fault alert
Posted: Tue Nov 29, 2016 9:29 pm
by larry.lo
Hi all,
Our settings as below, thanks.
/usr/local/nagios/etc/nrpe/common.cfg
Code: Select all
command[check_log_vcs]=/usr/local/nagios/libexec/check_logfiles -f /usr/local/nagios/etc/check_log_vcs.cfg
/usr/local/nagios/etc/check_log_vcs.cfg
Code: Select all
$options = 'report=long, maxlength=65535';
@searches = ({
tag => '/var/VRTSvcs/log/engine_A.log',
logfile => '/var/VRTSvcs/log/engine_A.log',
criticalpatterns => [
'VCS WARNING',
'VCS NOTICE',
'VCS ERROR',
'VCS CRITICAL',
],
criticalexceptions => [
'ID 915785 daemon.error',
'resource control assignment failed',
],
warningpatterns => [
'WARN',
'Warn',
'warn',
'NOTICE',
'Notice',
'notice',
],
warningexceptions => [
'ID 627629 daemon.notice',
],
options => 'noprotocol, noperfdata, sticky=1800',
},
);
Re: Problem of fault alert
Posted: Wed Nov 30, 2016 2:49 pm
by avandemore
Let's start with some basics. Nagios doesn't provide technical support for 3rd party products, all we can do is verify Nagios itself is working correctly.
The plugin you are using is not a Nagios plugin. It's provided by this company:
https://labs.consol.de/nagios/check_logfiles/index.html
This is the entity who wrote and maintains the plugin and the best place to seek support. We can try to help with it, but you haven't provided the service definition. It would look something like the example from their site:
Code: Select all
check_logfiles --tag=ssh --logfile=/var/adm/messages \
--rotation SOLARIS \
--criticalpattern 'Failed password for root'
This is an example of a service definition:
service_def.png