Problem of fault alert

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
larry.lo
Posts: 24
Joined: Tue May 17, 2016 10:28 pm

Problem of fault alert

Post by larry.lo »

Hi Support,

We found that there are some fault alert appeared in our hosts, these fault alert were happened before.
It is a critical bug in Nagios, could you please advise the solution for us to fix it?

For example:
1st alert (original) was found on 2016/08/09, but 2nd & 3rd alert (fault) were found on 2016/08/19 and 2016/11/28.
These alerts are same and should be appear once, please see the following screenshot.
FaultAlert.png
You do not have the required permissions to view the files attached to this post.
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Problem of fault alert

Post by rkennedy »

I believe this is an issue with how the plugin is configured, could you please post the service definition for us to look at?
Former Nagios Employee
gormank
Posts: 1114
Joined: Tue Dec 02, 2014 12:00 pm

Re: Problem of fault alert

Post by gormank »

He's looking at the state history report, which is a list of states over time.
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: Problem of fault alert

Post by avandemore »

To be a bit more pedantic, the State History Report is a list of state transitions.

The state of that service obviously changed over time, so we'd like to see the service definition. This will tell us directly or indirectly on what can cause the object to change state.

Roughly the same type of info can be accessed with http://yournagios/nagios/jsonquery.html > Archive > Statechangelist.
Previous Nagios employee
larry.lo
Posts: 24
Joined: Tue May 17, 2016 10:28 pm

Re: Problem of fault alert

Post by larry.lo »

Hi all,
Our settings as below, thanks.

/usr/local/nagios/etc/nrpe/common.cfg

Code: Select all

command[check_log_vcs]=/usr/local/nagios/libexec/check_logfiles -f /usr/local/nagios/etc/check_log_vcs.cfg
/usr/local/nagios/etc/check_log_vcs.cfg

Code: Select all

$options = 'report=long, maxlength=65535';
@searches = ({
    tag => '/var/VRTSvcs/log/engine_A.log',
    logfile => '/var/VRTSvcs/log/engine_A.log',
    criticalpatterns => [
        'VCS WARNING',
        'VCS NOTICE',
        'VCS ERROR',
        'VCS CRITICAL',
    ],
    criticalexceptions => [
        'ID 915785 daemon.error',
        'resource control assignment failed',
    ],
    warningpatterns => [
        'WARN',
        'Warn',
        'warn',
        'NOTICE',
        'Notice',
        'notice',
    ],
    warningexceptions => [
        'ID 627629 daemon.notice',
    ],
    options => 'noprotocol, noperfdata, sticky=1800',
  },
);
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: Problem of fault alert

Post by avandemore »

Let's start with some basics. Nagios doesn't provide technical support for 3rd party products, all we can do is verify Nagios itself is working correctly.

The plugin you are using is not a Nagios plugin. It's provided by this company:

https://labs.consol.de/nagios/check_logfiles/index.html

This is the entity who wrote and maintains the plugin and the best place to seek support. We can try to help with it, but you haven't provided the service definition. It would look something like the example from their site:

Code: Select all

check_logfiles --tag=ssh --logfile=/var/adm/messages \
     --rotation SOLARIS \
     --criticalpattern 'Failed password for root'
This is an example of a service definition:
service_def.png
You do not have the required permissions to view the files attached to this post.
Previous Nagios employee
Locked