Page 2 of 3

Re: check_bind.sh pluguin

Posted: Tue Sep 22, 2015 11:50 am
by linuser
++ Name Server Statistics ++
598 IPv4 requests received
250 requests with EDNS(0) received
598 responses sent
250 responses with EDNS(0) sent
479 queries resulted in successful answer
15 queries resulted in authoritative answer
581 queries resulted in non authoritative answer
31 queries resulted in nxrrset
2 queries resulted in SERVFAIL
86 queries resulted in NXDOMAIN
46 queries caused recursion

Yes, the nagios user can read the file. One thing I did notice, and this may or may not be releated, is that I have a file system loop in the /var/named folder. For instance if I were to run " find -type f -name rndc" I would get back "File system loop detected; `/var/named/chroot/var/named' is part of the same file system loop as `/var/named'."

I wonder how that happened and how I can safely fix it.

Re: check_bind.sh pluguin

Posted: Tue Sep 22, 2015 12:12 pm
by linuser
I don't think the chroot loop is a problem according to this:

http://unix.stackexchange.com/questions ... p-detected

Also the nagios user has access to all the files under chroot that it needs.

Re: check_bind.sh pluguin

Posted: Tue Sep 22, 2015 2:36 pm
by jdalrymple
I'm not 100% clear - if you just run the plugin from the command line (no nrpe or nagios involved at all) do you get output besides a bunch of zeros?

Did you note these comments from the Exchange?
The script required a few tweeks to work. 1-put nagios in the named group, 2-add "sudo rndc stats"

That was it and it worked fine.

Re: check_bind.sh pluguin

Posted: Tue Sep 22, 2015 2:46 pm
by linuser
No, command line run only still returns all values zeroed out.

I did see that comment, however I took it to mean that it took him a couple weeks to get it to work at all. Thanks for pointing that out I suppose I'll just wait it out.

Re: check_bind.sh pluguin

Posted: Wed Sep 23, 2015 11:29 am
by hsmith
Let us know if there is anything else we can do, in the mean time we'll be waiting for your feedback.

Thanks!

Re: check_bind.sh pluguin

Posted: Wed Sep 23, 2015 12:14 pm
by linuser
Actually yes. Apparently I am misreading stuff all over the place. In the quote that jdalrymple posted, I read "tweeks" as weeks. I Have already done these 2 things. My nagios user is in the "named" group and the nagios user can sudo rndc stats. Now, I may be misinterpreting something again, and would like more help.

What exactly does the person mean when they say "add sudo rndc stats"? Add that to what? Where?

Re: check_bind.sh pluguin

Posted: Wed Sep 23, 2015 1:59 pm
by linuser
I really don't think what I described above is the problem. I have verified those 2 things. However - I was looking back over the check_bind.sh script and noticed it said something about using this in a chroot-bind environment, which I am doing. It says:

"If you're running BIND in a chroot environment, use this"
echo " option to define a path to be chrooted. Please adjust also"
echo " your sudo configuration to enable the Nagios user to chroot!"

Could this be the problem? If so how do I enable the nagios user to "chroot"?

Re: check_bind.sh pluguin

Posted: Wed Sep 23, 2015 3:02 pm
by Box293
linuser wrote: Problem is, this still does not append data to /tmp/named.stats.tmp and /var/named/data/named.stats.
Check the permissions on /tmp/named.stats.tmp and make sure the nagios user has permission to update it. This problem can occur because you may have originally tested it as the root user and the nagios user does not have permission to access that file. Just delete it if you want and then execute it via check_nrpe and see if it works.

Re: check_bind.sh pluguin

Posted: Wed Sep 23, 2015 3:29 pm
by linuser
Box293 wrote: Check the permissions on /tmp/named.stats.tmp and make sure the nagios user has permission to update it. This problem can occur because you may have originally tested it as the root user and the nagios user does not have permission to access that file. Just delete it if you want and then execute it via check_nrpe and see if it works.
Yea, I got that to work several posts back. The nagios user has access and can dump stats to the files in both the /tmp and the /var/named/data folders. The problem now is that when nagios polls or when I manually run the check_bind.sh scripts all the performance data has "0" values.

For example:

Bind9 is running. 0 successfull requests, 0 referrals, 0 nxdomains since last check. | 'success'=0 'referral'=0 'nxrrset'=0 'nxdomain'=0 'recursion'=0 'failure'=0 'duplicate'=0 'dropped'=0

So zeros across the board even though stats are being dumped in /tmp/named.stats.tmp and /var/named/data/named.stats.

Re: check_bind.sh pluguin

Posted: Wed Sep 23, 2015 3:36 pm
by linuser
Now here is something interesting. When there is a stat dump, the named.stats.tmp file in /tmp get modified however the "statistics dump" at the timestamp has no data compared to the statistics dump in /var/named/data/named.stats.

i.e., from named.stats.tmp at 1443039719

Code: Select all

+++ Statistics Dump +++ (1443039719)
This is the last line of the file, no data after that. However, in /var/named/data/named.stats I see this at that same timestamp.

Code: Select all

+++ Statistics Dump +++ (1443039719)
++ Incoming Requests ++
                  15 QUERY
++ Incoming Queries ++
                  15 A
++ Outgoing Queries ++
[View: default]
                  44 A
                   7 NS
                  36 AAAA
                   3 DS
                   8 DNSKEY
                  12 DLV
[View: _bind]
++ Name Server Statistics ++
                  15 IPv4 requests received
                  15 responses sent
                  15 queries resulted in successful answer
                  15 queries resulted in non authoritative answer
                   2 queries caused recursion
++ Zone Maintenance Statistics ++
                   2 IPv4 notifies sent
++ Resolver Statistics ++
[Common]
[View: default]
                  64 IPv4 queries sent
                  46 IPv6 queries sent
                  62 IPv4 responses received
                   2 NXDOMAIN received
                  43 query retries
                   2 query timeouts
                  22 IPv4 NS address fetches
                  24 IPv6 NS address fetches
                  13 DNSSEC validation attempted
                   9 DNSSEC validation succeeded
                   4 DNSSEC NX validation succeeded
                  43 queries with RTT 10-100ms
                  19 queries with RTT 100-500ms
Now I wonder if this is normal or a problem.?