nagios.cmd acknowledgements killing nagios service

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
optionstechnology
Posts: 234
Joined: Thu Nov 17, 2016 11:26 am

nagios.cmd acknowledgements killing nagios service

Post by optionstechnology »

When I try to acknowledge an alert using Thruk it kills the nagios box and writes the following to the log-

[1627565739] NDO-3: ndo_return = 1 (Commands out of sync; you can't run this command now)
[1627565739] NDO-3: ndo_get_object_id_name2(ndo.c:1312): Unable to store results
[1627565739] Caught SIGSEGV, shutting down...
[1627565744] Caught SIGTERM, shutting down...
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: nagios.cmd acknowledgements killing nagios service

Post by gsmith »

Hi,

Is this repeatable? If so, next time you get an alert please try
to acknowledge it via the Nagios web gui and let me know what
the results are.

Thanks
optionstechnology
Posts: 234
Joined: Thu Nov 17, 2016 11:26 am

Re: nagios.cmd acknowledgements killing nagios service

Post by optionstechnology »

yeah it happens every time I try to ack something via nagios.cmd

I can acknowledge the alerts inside nagios ok - sometimes it can take up to ten minutes for the acknowledgement to actually show up in the service though... not sure if its related to this issue

Could it be something to do with the ndo3? This is the only nagios server I have that is not downgrated back to ndo2.... and the only one that has this issue
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: nagios.cmd acknowledgements killing nagios service

Post by gsmith »

Hi

Yeah it seems like it is the NDO3 issue, that Thruk is somehow triggering consistently.

I don't know Thruk, but I am guessing nagios.cmd is where you define a command to be run in Nagios?

If so could you share the command please?
What OS are you running?
I ask because I am going to try and replicate the error for my Dev guys.

As for your original problem go ahead and downgrade from NDO3 --> NDO2 and let me know if that solves it.

Thanks
optionstechnology
Posts: 234
Joined: Thu Nov 17, 2016 11:26 am

Re: nagios.cmd acknowledgements killing nagios service

Post by optionstechnology »

The downgrade to ndo2 did fix it, ive tested on multiple servers and it seems the be the case everywhere

Sorry - not sure of the command structure - I just know that this is the way Thruk sends it acknowledgements to the nagios back end, at least I think it does, ill see if I can find out
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: nagios.cmd acknowledgements killing nagios service

Post by gsmith »

Hi,

Glad the downgrade worked.

I really appreciate you looking for the command Thruk is issuing. If it takes too much time it's
OK to bail on it.

Thanks!
optionstechnology
Posts: 234
Joined: Thu Nov 17, 2016 11:26 am

Re: nagios.cmd acknowledgements killing nagios service

Post by optionstechnology »

The command is sent via livestatus -

e.g.

[external_command][user][token] [nagserver] cmd: COMMAND [1629447329] SCHEDULE_SVC_DOWNTIME;hostname;Servicename;1629447300;1668597300;1;0;7200;user;comment
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: nagios.cmd acknowledgements killing nagios service

Post by gsmith »

Hi,

Thanks for that.

Have a great weekend!
Locked