NRDS Monitoring Problem

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
cklayus
Posts: 62
Joined: Tue May 27, 2014 8:42 am

NRDS Monitoring Problem

Post by cklayus »

We are monitoring hundreds of servers. in order to ease the deployment we are using nrds.
But WE HAVE A PROBLEM. NRDS client is a scheduled base vb script and some time it fails and stops sending data. And the services are in unknown state
Most of the time I just reinstall the client. But it's becoming annoying. Instead of nagios helping me I am chasing monitoring related issues.

Is there a way to fix this?
Or maybe there is a better simpler way to monitor 300 servers?
User avatar
lmiltchev
Former Nagios Staff
Posts: 13587
Joined: Mon May 23, 2011 12:15 pm

Re: NRDS Monitoring Problem

Post by lmiltchev »

NRDS client is a scheduled base vb script and some time it fails and stops sending data.
Did you try to determine why this is happening, instead of reinstalling the client? Is there anything in your environment that prevents the scheduled jobs from running?

You can try the following - open the "Task Scheduler", select the NRDS task, click on the "Action" menu and go to "Properties". Under the "General" tab, select the "Run with highest privileges" checkbox, and click "OK".
Be sure to check out our Knowledgebase for helpful articles and solutions!
cklayus
Posts: 62
Joined: Tue May 27, 2014 8:42 am

Re: NRDS Monitoring Problem

Post by cklayus »

Yes I tried in some cases the task wasnt running because of other task. which is understandable.
But in other cases the task would run but the client would just not send the data. The issues is as follows:
At firs at works fine but after some time it just stops sending data. in the scheduled task i see that i had run successfully but in the nagios log i dont see nothing. (I have to point out that it happens to only 5% of the servers most of them work just fine)
This is how the end of log file looks like:
12/16/2014 9:46:37 PM ---NRDS Transaction Finished--- :
12/16/2014 10:26:32 PM return results from plugin : WINPING+OK+%2D+Round+trip%3A+0+ms%2C+Packet+loss%3A+0%25%2C+packets%3A+5%7C%27round+trip%27%3D0ms%3B200%3B400%3B+%27packet+loss+rate%27%3D0%25%3B40%3B80%3B+packets%3D5%3B
12/16/2014 10:26:32 PM return code from plugin : 0
12/16/2014 10:26:32 PM command to be run : C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_pdm --processor --warning 85 --critical 95 --psamples 10 --pinterval 5
12/16/2014 10:26:32 PM command line executing : cmd /c C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_pdm --processor --warning 85 --critical 95 --psamples 10 --pinterval 5 > %temp%\output.txt
12/16/2014 10:26:34 PM return results from plugin : PROCESSOR+OK+%2D+usage+0%2E00%25%7C%27processor+usage%27%3D0%2E00%25%3B85%3B95%3B
12/16/2014 10:26:34 PM return code from plugin : 0
12/16/2014 10:26:34 PM command to be run : C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_pdm.exe --memory -w 96 -c 99
12/16/2014 10:26:34 PM command line executing : cmd /c C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_pdm.exe --memory -w 96 -c 99 > %temp%\output.txt
12/16/2014 10:26:36 PM return results from plugin : MEMORY+OK+%2D+Physical+usage%3A+846+MB+%2882%25+of+1023+MB%29%7C%27memory+in+use%27%3D846MB%3B+%27memory+usage%27%3D82%25%3B96%3B99%3B+%27memory+total%27%3D1023MB%3B
12/16/2014 10:26:36 PM return code from plugin : 0
12/16/2014 10:26:36 PM command to be run : C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_pdm.exe --memory pagefile -w 80 -c 95
12/16/2014 10:26:36 PM command line executing : cmd /c C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_pdm.exe --memory pagefile -w 80 -c 95 > %temp%\output.txt
12/16/2014 10:26:38 PM return results from plugin : MEMORY+OK+%2D+Pagefile+usage%3A+1997+MB+%2860%25+of+3328+MB%29%7C%27memory+in+use%27%3D1997MB%3B+%27memory+usage%27%3D60%25%3B80%3B95%3B+%27memory+total%27%3D3328MB%3B
12/16/2014 10:26:38 PM return code from plugin : 0
12/16/2014 10:26:38 PM command to be run : cscript //nologo C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_all_disk.vbs /w:10 /c:5
12/16/2014 10:26:38 PM command line executing : cmd /c cscript //nologo C:\PROGRA~1\Nagios\NRDS_Win\plugins\check_all_disk.vbs /w:10 /c:5 > %temp%\output.txt
12/16/2014 10:26:38 PM return results from plugin : All+DISKS+OK+%7C+C%3A%5C+Free+in+%25%3D20%3B10%3B5%2C
12/16/2014 10:26:38 PM return code from plugin : 0
12/16/2014 10:26:38 PM XML to be posted to NRDP : token=VzgGWzIdN1eFE87vqv1okE481vB3zj6H&cmd=submitcheck&XMLDATA=<?xml version='1.0'?><checkresults><checkresult type='host' checktype='1'><hostname>GEMS_DC01</hostname><state>0</state><output>WINPING+OK+%2D+Round+trip%3A+0+ms%2C+Packet+loss%3A+0%25%2C+packets%3A+5%7C%27round+trip%27%3D0ms%3B200%3B400%3B+%27packet+loss+rate%27%3D0%25%3B40%3B80%3B+packets%3D5%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Cpu Load</servicename><state>0</state><output>PROCESSOR+OK+%2D+usage+0%2E00%25%7C%27processor+usage%27%3D0%2E00%25%3B85%3B95%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Memory Usage</servicename><state>0</state><output>MEMORY+OK+%2D+Physical+usage%3A+846+MB+%2882%25+of+1023+MB%29%7C%27memory+in+use%27%3D846MB%3B+%27memory+usage%27%3D82%25%3B96%3B99%3B+%27memory+total%27%3D1023MB%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Page File Usage</servicename><state>0</state><output>MEMORY+OK+%2D+Pagefile+usage%3A+1997+MB+%2860%25+of+3328+MB%29%7C%27memory+in+use%27%3D1997MB%3B+%27memory+usage%27%3D60%25%3B80%3B95%3B+%27memory+total%27%3D3328MB%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Disks Free Space</servicename><state>0</state><output>All+DISKS+OK+%7C+C%3A%5C+Free+in+%25%3D20%3B10%3B5%2C</output></checkresult></checkresults>
12/16/2014 10:26:38 PM running Post routine.. :
12/16/2014 10:26:38 PM ignoring certificate errors : 13056
12/16/2014 10:26:38 PM Post Data.... : token=VzgGWzIdN1eFE87vqv1okE481vB3zj6H&cmd=submitcheck&XMLDATA=<?xml version='1.0'?><checkresults><checkresult type='host' checktype='1'><hostname>GEMS_DC01</hostname><state>0</state><output>WINPING+OK+%2D+Round+trip%3A+0+ms%2C+Packet+loss%3A+0%25%2C+packets%3A+5%7C%27round+trip%27%3D0ms%3B200%3B400%3B+%27packet+loss+rate%27%3D0%25%3B40%3B80%3B+packets%3D5%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Cpu Load</servicename><state>0</state><output>PROCESSOR+OK+%2D+usage+0%2E00%25%7C%27processor+usage%27%3D0%2E00%25%3B85%3B95%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Memory Usage</servicename><state>0</state><output>MEMORY+OK+%2D+Physical+usage%3A+846+MB+%2882%25+of+1023+MB%29%7C%27memory+in+use%27%3D846MB%3B+%27memory+usage%27%3D82%25%3B96%3B99%3B+%27memory+total%27%3D1023MB%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Page File Usage</servicename><state>0</state><output>MEMORY+OK+%2D+Pagefile+usage%3A+1997+MB+%2860%25+of+3328+MB%29%7C%27memory+in+use%27%3D1997MB%3B+%27memory+usage%27%3D60%25%3B80%3B95%3B+%27memory+total%27%3D3328MB%3B</output></checkresult><checkresult type='service' checktype='1'><hostname>GEMS_DC01</hostname><servicename>Disks Free Space</servicename><state>0</state><output>All+DISKS+OK+%7C+C%3A%5C+Free+in+%25%3D20%3B10%3B5%2C</output></checkresult></checkresults>
12/16/2014 10:26:38 PM address..... : https://monitor.edpcloud.co.il/nrdp//?t ... 3zj6H&cmd=
12/16/2014 10:26:38 PM post response from NRDP : <?xml version="1.0" encoding="utf-8"?>
<result>
<status>0</status>
<message>OK</message>
<meta>
<output>5 checks processed.</output>
</meta>
</result>

12/16/2014 10:26:38 PM ---NRDS Transaction Finished--- :
12/16/2014 10:46:00 PM base NRDP address locked and loaded : https://monitor.edpcloud.co.il/nrdp//?t ... 3zj6H&cmd=
12/16/2014 10:46:00 PM XML being passed to check for updates : https://monitor.edpcloud.co.il/nrdp//?t ... DATA=<?xml version='1.0' ?><configs><config><name>WIN_GENERIC</name><version>3.9</version></config></configs>
12/16/2014 10:46:00 PM running Post routine.. :
12/16/2014 10:46:00 PM ignoring certificate errors : 13056
12/16/2014 10:46:00 PM Post Data.... :
12/16/2014 10:46:00 PM address..... : https://monitor.edpcloud.co.il/nrdp//?t ... DATA=<?xml version='1.0' ?><configs><config><name>WIN_GENERIC</name><version>3.9</version></config></configs>
12/21/2014 7:21:26 AM Diagnostic mode : False
12/22/2014 6:27:04 PM Diagnostic mode : False
12/25/2014 11:46:47 PM Diagnostic mode : False
12/26/2014 8:36:44 AM Diagnostic mode : False
12/26/2014 8:36:44 AM base NRDP address locked and loaded : https://monitor.edpcloud.co.il/nrdp//?t ... 3zj6H&cmd=


as you can see the last entry is couple of weeks ago.....
User avatar
lmiltchev
Former Nagios Staff
Posts: 13587
Joined: Mon May 23, 2011 12:15 pm

Re: NRDS Monitoring Problem

Post by lmiltchev »

What is the size of the "NRDS_Debug.log" on the Windows box and when was the last time it was modified?
Be sure to check out our Knowledgebase for helpful articles and solutions!
cklayus
Posts: 62
Joined: Tue May 27, 2014 8:42 am

Re: NRDS Monitoring Problem

Post by cklayus »

its 84 KB it is constantly changes the last mod date. so now its today..... but i dont see any changes in the actual log
User avatar
lmiltchev
Former Nagios Staff
Posts: 13587
Joined: Mon May 23, 2011 12:15 pm

Re: NRDS Monitoring Problem

Post by lmiltchev »

but i dont see any changes in the actual log
Do you mean you only see the "old" entries from a couple of weeks ago?

Double click on the scheduled task, go to the "Settings" tab, and see if the "Stop the task if it runs longer than" check-box is selected. I've seen cases when the task stops running after a certain period of time (1 day, 3 days, etc.).
Be sure to check out our Knowledgebase for helpful articles and solutions!
cklayus
Posts: 62
Joined: Tue May 27, 2014 8:42 am

Re: NRDS Monitoring Problem

Post by cklayus »

yes

this box is cheked
User avatar
lmiltchev
Former Nagios Staff
Posts: 13587
Joined: Mon May 23, 2011 12:15 pm

Re: NRDS Monitoring Problem

Post by lmiltchev »

Have you compared the configs from the Nagios XI server (Admin->NRDS Config Manager) and the Windows box (C:\Program Files\Nagios\NRDS_Win\config.ini)? Any inconsistencies (IP address, token, config name, etc.)?
Be sure to check out our Knowledgebase for helpful articles and solutions!
cklayus
Posts: 62
Joined: Tue May 27, 2014 8:42 am

Re: NRDS Monitoring Problem

Post by cklayus »

The version is the same, everything else is ok. its not just this one server its about 10% of all my servers.

I tried using the NCPA client. The host appeared in the unconfigured objects but the never sent any data.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: NRDS Monitoring Problem

Post by scottwilkerson »

cklayus wrote: I tried using the NCPA client. The host appeared in the unconfigured objects but the never sent any data.
Did you configure the services after you saw it in unconfigured objects? No data will appe4ar until you run the items through the wizard. they appear in unconfigured objects if the XI server doesn't have the checks configured.

If it still appears in unconfigured objects, select the objects and click the blue arrow to configure them
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Locked