Primary partition out of space
Primary partition out of space
My primary partition ran out of space over night and it looks like the culprit is a single log file located in Var/log/elasticsearch. The name of the file is 6146005a-2b46-45cd-acf7-d6c4d20f245b.log and it is about 32gb. Wondering if this file is safe to delete and also what may have caused it to grow so fast to prevent this from happening again. Unfortunately the file is too large to open to see what is inside.
Re: Primary partition out of space
What I would do is first figure out what's filling up that file:
This should clean that out:
Then monitor it, if it continues to grow fast, please send us a snippet of what's going into the log that is filling it up.
Once you've done that, please PM me a copy of your profile, you can download it from Admin > System Status by clicking the Download System Profile button.
Let us know the results.
Thank you
Code: Select all
tail -n1000 /var/log/elasticsearch/6146005a-2b46-45cd-acf7-d6c4d20f245b.logCode: Select all
service elasticsearch stop
rm -f /var/log/elasticsearch/6146005a-2b46-45cd-acf7-d6c4d20f245b.log
service elasticsearch start
Once you've done that, please PM me a copy of your profile, you can download it from Admin > System Status by clicking the Download System Profile button.
Let us know the results.
Thank you
Re: Primary partition out of space
Sean,
After removing the log file and restarting elastic search the machine will no longer boot up. I am assuming this is due to an issue caused by the primary partition running out of space. On boot I get the following messages over and over.
systemd-fsck[689] /dev/sdb1: recovering journal
Info: task fsck.ext4:700 blocked for more than 120 seconds
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disabled this message
After removing the log file and restarting elastic search the machine will no longer boot up. I am assuming this is due to an issue caused by the primary partition running out of space. On boot I get the following messages over and over.
systemd-fsck[689] /dev/sdb1: recovering journal
Info: task fsck.ext4:700 blocked for more than 120 seconds
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disabled this message
Re: Primary partition out of space
If that's not working on boot you likely will need to boot into recovery mode and run an fsck.
Is this system a VM or a physical machine? What OS/version?
What specifically was done? Deleting the file and rebooting would not cause filesystem corruption unless you are uncleanly shutting it down (hard power off, etc). This is almost acting like a disk issue.
What is the output of these commands (if you're able):
What files are filling up the system?
Please send me a copy of your profile (if you're able to), you can download it from Admin > System Status by clicking the Download System Profile button.
Create a ticket for this and include a link back to this forum thread so we can get a remote session setup:
https://support.nagios.com/tickets/
Is this system a VM or a physical machine? What OS/version?
What specifically was done? Deleting the file and rebooting would not cause filesystem corruption unless you are uncleanly shutting it down (hard power off, etc). This is almost acting like a disk issue.
What is the output of these commands (if you're able):
Code: Select all
df -h
df -i
dmesg
uname -a
cat /etc/*release
Code: Select all
du -sh /usr/local/nagioslogserver/var/log/*
du -sh /var/log/*Create a ticket for this and include a link back to this forum thread so we can get a remote session setup:
https://support.nagios.com/tickets/
Re: Primary partition out of space
Hello Sean,
I have created ticket 694429 and included a link back to this thread.
I have created ticket 694429 and included a link back to this thread.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Primary partition out of space
Great, Locking thread, moving to ticket.jvaira wrote:Hello Sean,
I have created ticket 694429 and included a link back to this thread.