Page 1 of 1

Services showing alert in state history

Posted: Thu Jan 03, 2019 11:54 am
by manimurugesan
Hello Team,

we are facing one issue on new version i.e nagiosxi 5.5.5

If host went down and it came up next few minits but all the services again showing ok with hard alert at 1 of 5 but all the services working fine only it's not showing any alert previously host only went down but why all the services showing ok with hard alert in state history ?

For example A is host name and it has 50 services if host went down at 5'o clock and it came up at 5:10 but after that all the 50 services started showing OK with hard alert at 1 of 5 attempt in state history.

Could you please help us to understand ?

Re: Services showing alert in state history

Posted: Thu Jan 03, 2019 5:58 pm
by npolovenko
Hello, @manimurugesan.

If a host went down and came back up before the service check was due, then the service check will stay in a hard OK state.
50 services started showing OK with hard alert at 1 of 5 attempt in state history
That just means that all services were in OK state, it's normal.

Does this answer your question? If not please provide some screenshots to help us better understand the problem.

Thank you.

Re: Services showing alert in state history

Posted: Fri Jan 04, 2019 12:37 pm
by manimurugesan
This is happening in only new version nagiosxi 5.5.5. I never see this in older version.

Could you please confirm this functionality has been changed ?

Re: Services showing alert in state history

Posted: Fri Jan 04, 2019 1:22 pm
by npolovenko
@manimurugesan, I believe this functionality has been around for a while. But I'm not 100% sure if we're talking about the same thing. Host checks and service checks can have different check intervals. For example, if a host check has a retry interval of 1 minute and its service checks have a retry interval of 1 hour, then if that host goes down and recovers within 2 minutes none of the checks will execute during that time and so they'll stay in OK state. Was it different for you before the upgrade? Would checks go Critical immediately when the host goes critical?
Could you attach a state history report? That way I can take a look at the state changes.
Also, please post the host and service checks definitions.

Re: Services showing alert in state history

Posted: Tue Jan 08, 2019 12:41 pm
by manimurugesan
Hello

Please find the state history and host checks, service checks
Still we are receiving Ok hard alert for all services .

Please clarify about this issue

Re: Services showing alert in state history

Posted: Tue Jan 08, 2019 4:30 pm
by npolovenko
@manimurugesan, I see. Looks like there was a bug that we fixed in XI 5.5.7.
5.5.7
==================
- Fixed Core issue (#572) causing service recovery emails to be sent when an initial notification wasn't sent.
You can upgrade now or wait a few more weeks for the 5.5.8 release. It will have this plus some more bug fixes for the latest Core.

Re: Services showing alert in state history

Posted: Fri Feb 15, 2019 12:05 pm
by manimurugesan
Hello ,


Recently i have upgraded to Nagiosxi 5.5.9 but still i am facing below issue and 5.5.5 also i have checked

1. all the services getting hard alert at 1 of 5 attempt
2. but previous version 5.4.13 i was used in that version hard alert will come at 5 of 5
3.previous post you have replied next version this issue will fix but even i tried 5.5.9 the same came again
4. please confirm if host alert is fluctuating or it's went down means all the services getting hard at 1 of 5 ?
In Nagiosxi 5.5.8 version this will resolve ?

let me know which version will fix this issue ?

Re: Services showing alert in state history

Posted: Fri Feb 15, 2019 2:30 pm
by npolovenko
@manimurugesan
1. all the services getting hard alert at 1 of 5 attempt
Was the host already down when services went to HARD 1 of 5 right away? If so, services will skip the soft state count 1/5, 2/5, 3/5, etc and go to the HARD 1/5 state right away.
3.previous post you have replied next version this issue will fix but even i tried 5.5.9 the same came again
4. please confirm if host alert is fluctuating or it's went down means all the services getting hard at 1 of 5 ?
This is the expected behavior. That way if a host is down Nagios won't spend resources on checking its services. The logic is that if a server(host) is down there is no need to check its services, such as CPU, memory, etc. We assume that these services will be down as well because the server is offline. That's why nagios puts these services in a HARD state right away, skipping soft states and retries.

5.5.9 fixed the issue where services will send email notifications when the host is down.

Please let me know if there is something I didn't cover.

Re: Services showing alert in state history

Posted: Wed Feb 20, 2019 12:05 pm
by manimurugesan
Hello ,

Is there any way to copy the only acknowledgements(schedule downtime,recurring downtime)
Because right now we have 2 server's one primary and secondary

Actually issue is we have enabled snmp trap sender so two server's has to be in same page .
actually primary we are using nagiosxi 5.5.5 version but secondary server we have recently upgraded to 5.5.8
So please let me know how to copy the all acknowledgements from primary to secondary ?

Re: Services showing alert in state history

Posted: Thu Feb 21, 2019 5:20 pm
by npolovenko
@manimurugesan, This is a bit different question and may need a separate thread. Scheduled downtime is stored in the retention.dat file as well as in the database so there is no easy way to transfer it to the secondary server.
Recurring downtime is stored in the recurring downtime file and you can copy the contents of that file and paste into the same file on the secondary server.
/usr/local/nagios/etc/recurringdowntime.cfg
Are your primary and secondary server checking same hosts and services? It may be easier to upgrade the primary server to 5.5.8 and then use the backup-restore.