Page 1 of 1

Error when checking linux disks

Posted: Mon Nov 04, 2019 12:22 pm
by paulol
Sometimes Nagios XI returns this error in some disks checks "is not accessible: No such file or directory". But it's not all the time.

Re: Error when checking linux disks

Posted: Mon Nov 04, 2019 4:37 pm
by cdienger
What does the configuration for these service checks look like? What happens if you try running the commands directly on the XI command line?

Re: Error when checking linux disks

Posted: Tue Nov 05, 2019 12:14 pm
by paulol
[nagios@TREVOUX ~]$ /usr/local/nagios/libexec/check_nrpe -2 -H 10.0.72.73 -t 50 -u -c check_disk -a '-w 10% -c 5% -W 5% -K 2% -p /data'
DISK OK - free space: /data 27489 MB (28.77% inode=100%);| /data=68060MB;90617;95651;0;100686

Re: Error when checking linux disks

Posted: Tue Nov 05, 2019 5:34 pm
by cdienger
Can we get a screenshot of this check's settings as seen in the CCM(edit the "/usr Disk Usage" service under Configure > Core Config Manager > Monitoring > Services).

Re: Error when checking linux disks

Posted: Mon Nov 11, 2019 11:09 am
by paulol
Please find attached the screenshot.

Re: Error when checking linux disks

Posted: Mon Nov 11, 2019 3:42 pm
by mbellerue
That's a lot of $ARG#$ defined. Is that a custom command? Can you trim that command down so it only has $ARG1$ and $ARG2$?

Re: Error when checking linux disks

Posted: Tue Nov 26, 2019 7:21 am
by paulol
ok.

Re: Error when checking linux disks

Posted: Tue Nov 26, 2019 3:31 pm
by mbellerue
We might need a profile for this one. If you could go to Admin -> System Profile -> Download Profile, and send that file to me in a PM, that would be great. Also, about how often do these soft errors occur? Is it on any one particular host, or is it spread out across all of the servers you monitor?

Re: Error when checking linux disks

Posted: Tue Nov 26, 2019 3:36 pm
by ssax
Interesting, never seen this before.

On 10.0.72.73, what version of the check_disk plugin are you running?

Code: Select all

/usr/local/nagios/libexec/check_disk -V
Are you seeing anything in dmesg on that server? (run it from the XI server as well)

Code: Select all

dmesg
What about in /var/log/messages?

What version of the NRPE agent are you running on that system?

Code: Select all

/usr/local/nagios/libexec/check_nrpe -H 10.0.72.73 -2
Please PM me a copy of your profile as well so that we can see if there is anything nagios related going on, you can download it from Admin > System Profile > Download Profile.