Alerting on Unix filesystems separately
Posted: Mon Sep 05, 2016 7:58 am
Hi
We have unix systems with multiple mount points which are supported by different teams.
We want to be able to alert individually for these filesystems. At the moment, when one filesystem goes dritical, all filesystems are sent in the alert.
This is the command that we are using $USER1$/check_nrpe -H $HOSTADDRESS$ -t 30 -c check_disk -a '-w 20% -c 10%'
This is the command output:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H 10.54.125.81 -t 30 -c check_disk -a '-w 20% -c 10%'
OUTPUT: DISK CRITICAL - free space: / 621 MB (60% inode=85%); /usr 1004 MB (17% inode=68%); /var 1243 MB (48% inode=94%); /tmp 1129 MB (55% inode=98%); /home 1325 MB (64% inode=98%); /proc 0 MB (0% inode=-); /opt 1283 MB (31% inode=90%); /admin 511 MB (99% inode=99%); /var/adm/ras/livedump 511 MB (99% inode=99%); /usr/local 966 MB (94% inode=99%); /audit 500 MB (97% inode=99%); /DoOnceAix 511 MB (99% inode=99%); /opt/IBM/SCM 511 MB (99% inode=99%); /opt/Tivoli 1023 MB (99% inode=99%); /oracleinstall 20476 MB (99% inode=99%); /u001 311252 MB (72% inode=99%); /u101 38739 MB (14% inode=99%); /u102 93295 MB (15% inode=99%); /u103 158522 MB (99% inode=99%); /u104 158545 MB (99% inode=99%); /u105 455134 MB (96% inode=99%); /u106 110856 MB (24% inode=99%); /backup01 305105 MB (99% inode=99%); /backup02 305617 MB (99% inode=99%); /tmp2 101360 MB (99% inode=99%);| /=402MB;819;921;0;1024 /usr=4627MB;4505;5068;0;5632 /var=1316MB;2048;2304;0;2560 /tmp=918MB;1638;1843;0;2048 /home=722MB;1638;1843;0;2048 /proc=2147483647MB;214748
It's very difficult to find which disk has gone critical from this output.
Also we don;t want to manually add in a check for each mount.
Any ideas?
Thanks
Pete
We have unix systems with multiple mount points which are supported by different teams.
We want to be able to alert individually for these filesystems. At the moment, when one filesystem goes dritical, all filesystems are sent in the alert.
This is the command that we are using $USER1$/check_nrpe -H $HOSTADDRESS$ -t 30 -c check_disk -a '-w 20% -c 10%'
This is the command output:
COMMAND: /usr/local/nagios/libexec/check_nrpe -H 10.54.125.81 -t 30 -c check_disk -a '-w 20% -c 10%'
OUTPUT: DISK CRITICAL - free space: / 621 MB (60% inode=85%); /usr 1004 MB (17% inode=68%); /var 1243 MB (48% inode=94%); /tmp 1129 MB (55% inode=98%); /home 1325 MB (64% inode=98%); /proc 0 MB (0% inode=-); /opt 1283 MB (31% inode=90%); /admin 511 MB (99% inode=99%); /var/adm/ras/livedump 511 MB (99% inode=99%); /usr/local 966 MB (94% inode=99%); /audit 500 MB (97% inode=99%); /DoOnceAix 511 MB (99% inode=99%); /opt/IBM/SCM 511 MB (99% inode=99%); /opt/Tivoli 1023 MB (99% inode=99%); /oracleinstall 20476 MB (99% inode=99%); /u001 311252 MB (72% inode=99%); /u101 38739 MB (14% inode=99%); /u102 93295 MB (15% inode=99%); /u103 158522 MB (99% inode=99%); /u104 158545 MB (99% inode=99%); /u105 455134 MB (96% inode=99%); /u106 110856 MB (24% inode=99%); /backup01 305105 MB (99% inode=99%); /backup02 305617 MB (99% inode=99%); /tmp2 101360 MB (99% inode=99%);| /=402MB;819;921;0;1024 /usr=4627MB;4505;5068;0;5632 /var=1316MB;2048;2304;0;2560 /tmp=918MB;1638;1843;0;2048 /home=722MB;1638;1843;0;2048 /proc=2147483647MB;214748
It's very difficult to find which disk has gone critical from this output.
Also we don;t want to manually add in a check for each mount.
Any ideas?
Thanks
Pete