Page 1 of 3

How to monitor following things in a single server

Posted: Tue Mar 18, 2014 3:23 am
by Rupender
Hi All,

Anyone can tell me how to monitor these following things in a single server with the help of Nagiosxi server:-

Current Load (load average more than 75% of number of CPU cores is critical)
HTTPS (site accessibility)
Disk space (less than 30% of amount is critical)
Swap Usage (more than 10% is critical)
RAID Array State (consistency of the array for your adaptor)
Running Processes when using RADIUS (vcs-bbradiusd at least 3 processes running)
Running Processes when using SIP Redirect (vcs-bbsipd at least 3 processes running)


Please provide me some guidelines so that i am able to monitor those things?

Thanks,

Re: How to monitor following things in a single server

Posted: Tue Mar 18, 2014 9:41 am
by abrist
The decision of which plugins to use for these checks relies on the agent/check method and subsequently on the platform itself (especially for hardware/system checks like memory/cpu/hd). What OS are these servers running?

Re: How to monitor following things in a single server

Posted: Tue Mar 18, 2014 9:44 am
by sreinhardt
Depending on what options you have available to you, and the os of the system you wish to monitor. This could be done via ncpa or wmi for windows, or ncpa, snmp, or check by ssh on linux systems. Almost all of these checks will be included in either the ncpa wizard (windows and linux use the same), linux server wizard, wmi wizard, or by setting up manual check by ssh commands and services. The only two I would point out as being separate, is if you wish to monitor the http server from the nagios machine, the website wizard would probably be best, and the raid monitoring will also be separate as you will likely need a custom check setup for that. How do you think you would like to proceed?

Re: How to monitor following things in a single server

Posted: Tue Mar 18, 2014 11:23 am
by Rupender
Hi All,

Thanks for update,

I added one Centos server with the help of SNMP agent , all things are going good except following process monitoring.

==Running Processes when using RADIUS (vcs-bbradiusd at least 3 processes running)
== Running Processes when using SIP Redirect (vcs-bbsipd at least 3 processes running)


As you can see below Nagiosxi is not able to find the process while when i grep in terminal here is the output:-

[root@billberry snmp]# ps -aux |grep vcs-bbsipd
Warning: bad syntax, perhaps a bogus '-'? See /usr/share/doc/procps-3.2.8/FAQ
vcs 10585 0.0 0.0 257468 13004 ? S Mar17 0:28 vcs-bbsipd [/usr/local/vcs/bin/bbsipd.py]
vcs 10588 0.0 0.0 257468 11940 ? S Mar17 0:07 vcs-bbsipd: worker 1
vcs 10589 0.0 0.0 257468 11940 ? S Mar17 0:06 vcs-bbsipd: worker 2
root 12299 0.0 0.0 103252 828 pts/2 S+ 12:22 0:00 grep vcs-bbsipd

Here is the Alert:---------

***** Nagios XI Alert *****

Nagios has detected a problem with this service.

Notification Type: PROBLEM

Service: vcs-bbsipd
Host: PRT_INT_Jerasoft_195
Address: 169.131.241.195
State: CRITICAL
Info:
0 process matching vcs-bbsipd (= 0 : CRITICAL) (= 10):OK
Date/Time: 2014-03-18 12:09:17

Re: How to monitor following things in a single server

Posted: Tue Mar 18, 2014 1:28 pm
by sreinhardt
I do believe it is looking for exact process name matches. Please try reconfiguring those checks with the exact name such as vcs-bbsipd

Re: How to monitor following things in a single server

Posted: Wed Mar 19, 2014 7:20 am
by Rupender
Hi sreinhardt,

I checked with different ways but no luck.



[root@localusarad libexec]# ./check_snmp_process_wizard.pl -H 169.131.241.195 -C public --v2c -n 'vcs-bbradiusd' -w '2,9'
0 process matching vcs-bbradiusd (<= 0 : CRITICAL) (<= 9):OK


Could you please give me some more suggestion.

Re: How to monitor following things in a single server

Posted: Wed Mar 19, 2014 12:28 pm
by sreinhardt
Try doing an snmp walk against the processes oid, and seeing what you come up with:

Code: Select all

snmpwalk -v2c -c public 169.131.241.195 1.3.6.1.2.1.25.4.2.1
snmpwalk -v2c -c public 169.131.241.195 1.3.6.1.2.1.25.4.2.1 | grep -i vcs

Re: How to monitor following things in a single server

Posted: Thu Mar 20, 2014 4:05 am
by Rupender
I ran this command and here is the output


[root@localusarad ~]# snmpwalk -v2c -c public 169.131.241.195 1.3.6.1.2.1.25.4.2.1 | grep -i vcs
HOST-RESOURCES-MIB::hrSWRunName.8943 = STRING: "vcs-xmlrpcd [/u"
HOST-RESOURCES-MIB::hrSWRunName.9365 = STRING: "vcs-bbradiusd ["
HOST-RESOURCES-MIB::hrSWRunName.10585 = STRING: "vcs-bbsipd [/us"
HOST-RESOURCES-MIB::hrSWRunName.10588 = STRING: "vcs-bbsipd: wor"
HOST-RESOURCES-MIB::hrSWRunName.10589 = STRING: "vcs-bbsipd: wor"

HOST-RESOURCES-MIB::hrSWRunName.19238 = STRING: "vcs-bbradiusd: "
HOST-RESOURCES-MIB::hrSWRunName.19239 = STRING: "vcs-bbradiusd: "
HOST-RESOURCES-MIB::hrSWRunName.19240 = STRING: "vcs-bbradiusd: "

HOST-RESOURCES-MIB::hrSWRunPath.8943 = STRING: "vcs-xmlrpcd [/usr/local/vcs/bin/

I need to monitor the highlighted process.
What should i do now? please give me the full details

Re: How to monitor following things in a single server

Posted: Thu Mar 20, 2014 10:38 am
by sreinhardt
After doing some testing on my machine, I found that the plugin does fuzzy matching against the name given, so depending on how specific you want to get you can do different checks for similarly named processes such as:

All vcs apps:

Code: Select all

./check_snmp_process_wizard.pl -H 169.131.241.195 -C public --v2c -n "vcs" -w '2,9'
Just vcs-bb apps: (If you are looking for the ones you bolded, I would suggest this)

Code: Select all

./check_snmp_process_wizard.pl -H 169.131.241.195 -C public --v2c -n "vcs-bb" -w '2,9'
Just vcs-bbsip apps:

Code: Select all

./check_snmp_process_wizard.pl -H 169.131.241.195 -C public --v2c -n "vcs-bbsip" -w '2,9'
Just vcs-bbradius apps:

Code: Select all

./check_snmp_process_wizard.pl -H 169.131.241.195 -C public --v2c -n "vcs-bbradius" -w '2,9'
Try those out and see if they work properly for you.

Re: How to monitor following things in a single server

Posted: Thu Mar 20, 2014 10:50 am
by Rupender
No luck....