Sudden inability to monitor SQL

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

Sudden inability to monitor SQL

Post by dlukinski »

Hello XI support

We are suddenly unable to monitor SQL Databases and Instances on the SQL sever 2012 SP3

DB: "Caught unexpected error. This could be caused by your sysperfinfo not containing the proper entries for this query, and you may delete this service check."

INSTANCE: "Connection to the database failed for an unknown reason."

This worked until it stopped 3 days ago.
I could still login to DB & instance with same credentials (local SQL, no meta chars in password)

CMD sample: -U 'nagiosxi' -P 'checkME123' -I 'FIKCM4DBD01S1' --averagewait --warning 150000 --critical 200000

Would this trouble be known to you?
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: Sudden inability to monitor SQL

Post by avandemore »

What is the full output from the Run Check Command?
Previous Nagios employee
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

Re: Sudden inability to monitor SQL

Post by dlukinski »

avandemore wrote:What is the full output from the Run Check Command?
[nagios@fikc-nagxiprod01.konecranes.com ~]$ /usr/local/nagios/libexec/check_mssql_database.py -H fikc-m4dbd01 -U 'nagiosxi' -P 'checkNAG1' -T 'Siebeldb_T1' -I FIKCM4DBD01S1 --datasize --warning 1500000000 --critical 1700000000
<class &apos;pymssql.InterfaceError&apos;>
Caught unexpected error. This could be caused by your sysperfinfo not containing the proper entries for this query, and you may delete this service check.

That server is accessible (NSclient checks are operational)
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Sudden inability to monitor SQL

Post by rkennedy »

Are any mssql checks working at all, or did they all stop working between a few different machines?

A couple things to verify -
1. database is accepting mixed auth.
2. any luck if you try the IP? Wondering if it's trying to go over IPv6 and failing.
Former Nagios Employee
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

Re: Sudden inability to monitor SQL

Post by dlukinski »

rkennedy wrote:Are any mssql checks working at all, or did they all stop working between a few different machines?

A couple things to verify -
1. database is accepting mixed auth.
2. any luck if you try the IP? Wondering if it's trying to go over IPv6 and failing.
Yes, authentication is mixed

Same results via IPv4
[nagios@fikc-nagxiprod01.konecranes.com ~]$ /usr/local/nagios/libexec/check_mssql_database.py -H 10.102.74.84 -U 'nagiosxi' -P 'checkNAG1' -T 'Siebeldb_T1' -I FIKCM4DBD01S1 --logshrinks --warning 30 --critical 50
<class &apos;pymssql.InterfaceError&apos;>
Caught unexpected error. This could be caused by your sysperfinfo not containing the proper entries for this query, and you may delete this service check.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Sudden inability to monitor SQL

Post by tmcdonald »

And as for @rkennedy's question regarding other checks, is it just this one that is failing or are all SQL checks against that machine failing? Are other machines' SQL checks failing as well? We're trying to see if this problem is local to just one machine in which case something might have changed there, or if it is on the Nagios side of things which could point to a broken upgrade of libraries or something else the plugin relies on.
Former Nagios employee
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

Re: Sudden inability to monitor SQL

Post by dlukinski »

tmcdonald wrote:And as for @rkennedy's question regarding other checks, is it just this one that is failing or are all SQL checks against that machine failing? Are other machines' SQL checks failing as well? We're trying to see if this problem is local to just one machine in which case something might have changed there, or if it is on the Nagios side of things which could point to a broken upgrade of libraries or something else the plugin relies on.
Yes all the checks are failing (same across multiple machines):

DB-Lib error message 20009, severity 9:
Unable to connect: Adaptive Server is unavailable or does not exist
Net-Lib error during Operation now in progress Error 115 - Operation now in progress

I think I have make a ticket out of this one

Thanks,
Dimitri
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Sudden inability to monitor SQL

Post by cdienger »

Hi Dimitri, Have you opened a ticket? If so, we'd like to close this thread while you continue to work with us through the ticket.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
dlukinski
Posts: 1130
Joined: Tue Oct 06, 2015 9:42 am

Re: Sudden inability to monitor SQL

Post by dlukinski »

cdienger wrote:Hi Dimitri, Have you opened a ticket? If so, we'd like to close this thread while you continue to work with us through the ticket.

OK, lets close this thread

/Dimitri
Locked