Page 1 of 1

nagios.cmd acknowledgements killing nagios service

Posted: Thu Jul 29, 2021 8:41 am
by optionstechnology
When I try to acknowledge an alert using Thruk it kills the nagios box and writes the following to the log-

[1627565739] NDO-3: ndo_return = 1 (Commands out of sync; you can't run this command now)
[1627565739] NDO-3: ndo_get_object_id_name2(ndo.c:1312): Unable to store results
[1627565739] Caught SIGSEGV, shutting down...
[1627565744] Caught SIGTERM, shutting down...

Re: nagios.cmd acknowledgements killing nagios service

Posted: Thu Jul 29, 2021 11:29 am
by gsmith
Hi,

Is this repeatable? If so, next time you get an alert please try
to acknowledge it via the Nagios web gui and let me know what
the results are.

Thanks

Re: nagios.cmd acknowledgements killing nagios service

Posted: Thu Jul 29, 2021 11:40 am
by optionstechnology
yeah it happens every time I try to ack something via nagios.cmd

I can acknowledge the alerts inside nagios ok - sometimes it can take up to ten minutes for the acknowledgement to actually show up in the service though... not sure if its related to this issue

Could it be something to do with the ndo3? This is the only nagios server I have that is not downgrated back to ndo2.... and the only one that has this issue

Re: nagios.cmd acknowledgements killing nagios service

Posted: Thu Jul 29, 2021 2:08 pm
by gsmith
Hi

Yeah it seems like it is the NDO3 issue, that Thruk is somehow triggering consistently.

I don't know Thruk, but I am guessing nagios.cmd is where you define a command to be run in Nagios?

If so could you share the command please?
What OS are you running?
I ask because I am going to try and replicate the error for my Dev guys.

As for your original problem go ahead and downgrade from NDO3 --> NDO2 and let me know if that solves it.

Thanks

Re: nagios.cmd acknowledgements killing nagios service

Posted: Thu Aug 19, 2021 9:40 am
by optionstechnology
The downgrade to ndo2 did fix it, ive tested on multiple servers and it seems the be the case everywhere

Sorry - not sure of the command structure - I just know that this is the way Thruk sends it acknowledgements to the nagios back end, at least I think it does, ill see if I can find out

Re: nagios.cmd acknowledgements killing nagios service

Posted: Thu Aug 19, 2021 11:18 am
by gsmith
Hi,

Glad the downgrade worked.

I really appreciate you looking for the command Thruk is issuing. If it takes too much time it's
OK to bail on it.

Thanks!

Re: nagios.cmd acknowledgements killing nagios service

Posted: Fri Aug 20, 2021 3:47 am
by optionstechnology
The command is sent via livestatus -

e.g.

[external_command][user][token] [nagserver] cmd: COMMAND [1629447329] SCHEDULE_SVC_DOWNTIME;hostname;Servicename;1629447300;1668597300;1;0;7200;user;comment

Re: nagios.cmd acknowledgements killing nagios service

Posted: Fri Aug 20, 2021 9:04 am
by gsmith
Hi,

Thanks for that.

Have a great weekend!