Passive checks reams to miss a beat
Posted: Thu Sep 24, 2020 6:51 am
Hi
We have a questions regarding passive control, NRDP,
Our Nagios XI 5.7.2 i running on Redhat6 64 in vmware.
We have created a host with 10 checks in Nagios XI, all passive.
They are updated from a client using a custom script.
If we run the script to update one service one time, all works as expected.
and if we run the script to update all 10 services once, all works as well.
But if we update the same service with a new status or status information more than once in quick succession.
It seems random which update is displayed.
If we delay updates of same service to 10 seconds, the last update is shows. Almost all the time.
The question is
How fast or slow can we update the same service, and expect the last of the updates to be shown. Every time ?
We have even tried to use send_nrpd.sh as a test, to rule out any error in our custom code.
% cat test10.sh
PAUSE=10
for t in 1 2 3 4 5 6 7 8 9 10; do
./send_nrdp.sh -u https://nagiosserver/nrdp/ -t <very secret token> -H testserver -s "batch job 1" -S 1 -o "Job run warning $t"
sleep $PAUSE
./send_nrdp.sh -u https://nagiosserver/nrdp/ -t <very secret token> -H testserver -s "batch job 1" -S 0 -o "Job run ok $t"
sleep $PAUSE
done
This run with similar results, PAUSE under 10 seems to give random results.
10 and above seems to work as expected, last update i shown.
I have ensured that send_nrdp.sh returns 1 every time
Sent 1 checks to https://nagiosserver/nrdp/
Sent 1 checks to https://nagiosserver/nrdp/
Sent 1 checks to https://nagiosserver/nrdp/
Sent 1 checks to https://nagiosserver/nrdp/
......
Please advice to the inner workings in Nagios, to figure out reasonable timings.
Regards.
Henrik
We have a questions regarding passive control, NRDP,
Our Nagios XI 5.7.2 i running on Redhat6 64 in vmware.
We have created a host with 10 checks in Nagios XI, all passive.
They are updated from a client using a custom script.
If we run the script to update one service one time, all works as expected.
and if we run the script to update all 10 services once, all works as well.
But if we update the same service with a new status or status information more than once in quick succession.
It seems random which update is displayed.
If we delay updates of same service to 10 seconds, the last update is shows. Almost all the time.
The question is
How fast or slow can we update the same service, and expect the last of the updates to be shown. Every time ?
We have even tried to use send_nrpd.sh as a test, to rule out any error in our custom code.
% cat test10.sh
PAUSE=10
for t in 1 2 3 4 5 6 7 8 9 10; do
./send_nrdp.sh -u https://nagiosserver/nrdp/ -t <very secret token> -H testserver -s "batch job 1" -S 1 -o "Job run warning $t"
sleep $PAUSE
./send_nrdp.sh -u https://nagiosserver/nrdp/ -t <very secret token> -H testserver -s "batch job 1" -S 0 -o "Job run ok $t"
sleep $PAUSE
done
This run with similar results, PAUSE under 10 seems to give random results.
10 and above seems to work as expected, last update i shown.
I have ensured that send_nrdp.sh returns 1 every time
Sent 1 checks to https://nagiosserver/nrdp/
Sent 1 checks to https://nagiosserver/nrdp/
Sent 1 checks to https://nagiosserver/nrdp/
Sent 1 checks to https://nagiosserver/nrdp/
......
Please advice to the inner workings in Nagios, to figure out reasonable timings.
Regards.
Henrik