check_ntp_time fails for first hour after restart
Posted: Mon Jun 17, 2013 4:56 am
Hi,
I've a strange issue, that I can't seem to find any links to on the web
I'm running nagios 3.4.1 on a VM (virtual box), under Ubuntu 12.04.
it's been up and running fine for the last 9 months (approx), and I've only seen real issues come out of it.
The machine recently suffered a disk issue (remounted root as r/o), but seems to be ok after a reboot & fsck. I'll replace the disk in due course, but for now here's where I am.
Whenever I restart nagios, _all_ the check_ntp_time scripts on the clients (various ubuntu flavours, running nrpe), give me "CRITICAL: Offset Unknown" for about an hour or so (I've not timed exactly), before they start to work.
If I run manually on a client I get:
/usr/lib/nagios/plugins/check_ntp_time -H lemon -w 1 -c 10 -v
sending request to peer 0
response from peer 0: offset -0.5986770391
sending request to peer 0
response from peer 0: offset -0.6005151272
sending request to peer 0
response from peer 0: offset -0.6005502939
sending request to peer 0
response from peer 0: offset -0.6005481482
discarding peer 0: flags=3
overall average offset: 0
NTP CRITICAL: Offset unknown|
(lemon is the name of my nagios host).
for reference on this machine (so it's not the leap-second bug), but it should be noted I've got many versions of the scripts from 1.4.10 onwards:
/usr/lib/nagios/plugins/check_ntp_time -V
check_ntp_time v1.4.14 (nagios-plugins 1.4.14)
Running the reverse on the nagios-server itself shows that I've got a bit of an offset, but within the thresholds:
/usr/local/nagios/libexec/check_ntp_time -v -H fandango
sending request to peer 0
response from peer 0: offset 0.7899760008
sending request to peer 0
response from peer 0: offset 0.7899907827
sending request to peer 0
response from peer 0: offset 0.7900112867
sending request to peer 0
response from peer 0: offset 0.7887747288
overall average offset: 0.7899760008
NTP OK: Offset 0.7899760008 secs|offset=0.789976s;60.000000;120.000000;
But a problem when running against localhost:
/usr/local/nagios/libexec/check_ntp_time -v -H localhost
sending request to peer 0
response from peer 0: offset -7.510185242e-06
sending request to peer 0
response from peer 0: offset -9.894371033e-06
sending request to peer 0
response from peer 0: offset -5.722045898e-06
sending request to peer 0
response from peer 0: offset -5.841255188e-06
discarding peer 0: flags=3
overall average offset: 0
NTP CRITICAL: Offset unknown|
Has anyone got any ideas on where to go from here?
I've a strange issue, that I can't seem to find any links to on the web
I'm running nagios 3.4.1 on a VM (virtual box), under Ubuntu 12.04.
it's been up and running fine for the last 9 months (approx), and I've only seen real issues come out of it.
The machine recently suffered a disk issue (remounted root as r/o), but seems to be ok after a reboot & fsck. I'll replace the disk in due course, but for now here's where I am.
Whenever I restart nagios, _all_ the check_ntp_time scripts on the clients (various ubuntu flavours, running nrpe), give me "CRITICAL: Offset Unknown" for about an hour or so (I've not timed exactly), before they start to work.
If I run manually on a client I get:
/usr/lib/nagios/plugins/check_ntp_time -H lemon -w 1 -c 10 -v
sending request to peer 0
response from peer 0: offset -0.5986770391
sending request to peer 0
response from peer 0: offset -0.6005151272
sending request to peer 0
response from peer 0: offset -0.6005502939
sending request to peer 0
response from peer 0: offset -0.6005481482
discarding peer 0: flags=3
overall average offset: 0
NTP CRITICAL: Offset unknown|
(lemon is the name of my nagios host).
for reference on this machine (so it's not the leap-second bug), but it should be noted I've got many versions of the scripts from 1.4.10 onwards:
/usr/lib/nagios/plugins/check_ntp_time -V
check_ntp_time v1.4.14 (nagios-plugins 1.4.14)
Running the reverse on the nagios-server itself shows that I've got a bit of an offset, but within the thresholds:
/usr/local/nagios/libexec/check_ntp_time -v -H fandango
sending request to peer 0
response from peer 0: offset 0.7899760008
sending request to peer 0
response from peer 0: offset 0.7899907827
sending request to peer 0
response from peer 0: offset 0.7900112867
sending request to peer 0
response from peer 0: offset 0.7887747288
overall average offset: 0.7899760008
NTP OK: Offset 0.7899760008 secs|offset=0.789976s;60.000000;120.000000;
But a problem when running against localhost:
/usr/local/nagios/libexec/check_ntp_time -v -H localhost
sending request to peer 0
response from peer 0: offset -7.510185242e-06
sending request to peer 0
response from peer 0: offset -9.894371033e-06
sending request to peer 0
response from peer 0: offset -5.722045898e-06
sending request to peer 0
response from peer 0: offset -5.841255188e-06
discarding peer 0: flags=3
overall average offset: 0
NTP CRITICAL: Offset unknown|
Has anyone got any ideas on where to go from here?