Page 1 of 1

In Maintenanve Window able to see Node down critical states

Posted: Fri Apr 17, 2020 1:10 pm
by lgaddam
Hi Team,

We got a request that to move 500 servers to Maintenance window for performing monthly patching activities on remote windows servers.
For this, we used "schedule Maintenance" option in Nagiosxi.

But here the issue is that, this option just mutes the Email alerting but still DOWN state of the node appears in Nagios console.We do not want these to be generated by Nagios.

Because our Command center team monitors "Latest Alerts" component 24*7.
They complained that they were able to see Dpwn alerts.

Even I tried changing Time Period option for some nodes and tested but still the DOWN Critical state appears in Nagiosxi.

Help me how to stop these down critical severity states in NagiosXI in the MAintenance window.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Fri Apr 17, 2020 2:18 pm
by ssax
They should still show as "Is Being Handled" when it's in downtime or if it's acknowledged, is it not showing like that?

Did you schedule the host AND the services? You need to do both.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Sat Apr 18, 2020 2:10 am
by lgaddam
In "Latest Alerts" components it wont show any status of the node.

Yes, we have chosen Hosts as well as its all services in Maintenance Window.

Could you please test it and let us know.

Manually to disable 500 nodes and enable 500 nodes becoming big task for us. Need to minimize this manual effort of work.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Mon Apr 20, 2020 4:23 pm
by ssax
I did test it, what I'm saying is that the Latest Alerts component will show "Is Being Handled" next to hosts/services that are in downtime OR if they are just acknowledged. It will still list them, it just will say "Is Being Handled" like the one in downtime I have in the picture attached.
Capture.PNG
Are they not showing as being handled? I can submit a feature request to exclude downtime/acknowledged but it currently will show them (albeit with "being handled" addded to let you know it's either in downtime or acknowledged already).

Re: In Maintenanve Window able to see Node down critical sta

Posted: Tue Apr 21, 2020 1:01 am
by lgaddam
Thanks Sean.

Let me check this with few nodes putting them in Maintenance.
I have to take approval for few machines and do it, will update you.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Tue Apr 21, 2020 2:15 pm
by ssax
Ok, we'll keep an eye out for your results.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Mon May 11, 2020 9:41 am
by lgaddam
Hi Sean,

I have tested and did not worked out as you said. Not able to see "Being Handled" at Latest Alert component.
Need to provide the results but not in public. How to share the doc in private.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Mon May 11, 2020 4:52 pm
by ssax
Please click PM to the right of one of my posts and send it that or you can create a ticket for this and include a link back to this forum thread and attach it to the ticket:

https://support.nagios.com/tickets/

Re: In Maintenanve Window able to see Node down critical sta

Posted: Wed May 13, 2020 1:34 am
by lgaddam
Thanks Sean.

I have done PM to you. Kindly check and help.

Re: In Maintenanve Window able to see Node down critical sta

Posted: Wed May 13, 2020 9:27 am
by ssax
What version of XI are you running?

What version of Core are you running?

Code: Select all

/usr/local/nagios/bin/nagios -V
The scheduled downtime screenshots show a start time of 18:45 to 22:29 so those alerts (showing from 16:XX) would not have been suppressed based on that downtime alone.

Another thing to remember is that only state changes are logged, if it's in the same state since a long time ago those alerts are not logged, see here:

Code: Select all

https://assets.nagios.com/downloads/nagioscore/docs/nagioscore/3/en/stalking.html
But that ping service definitely is showing it's in downtime so that should have shown that. Something must be wrong.

Please PM me a copy of your profile, include this file as well:

Code: Select all

/usr/local/nagios/var/status.dat
Or if you have a RAMDisk setup:

Code: Select all

/var/nagiosramdisk/status.dat