Backups failing | OutOfMemoryError[Java heap space]
Posted: Tue Nov 24, 2015 6:38 pm
I have two servers in a cluster each with a backup share mounted locally.
It managed to backup the first days backup then stopped working. No "backup snapshots" are present in Backup and Maintenance.
Both servers can access the share. I have reset all jobs and rerun but still doesn't work.
I can see in the logs that it shows the following:
2015-11-25 11:55:05,553 INFO Job starting...
2015-11-25 11:55:05,556 INFO Beginning SNAPSHOT operations...
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 736, in <module>
main()
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 731, in main
arguments.func(client, **argdict)
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 566, in command_loop
snapshot_list = get_object_list(client, data_type='snapshot', **kwargs)
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 279, in get_object_list
object_list = get_snaplist(client, repository, prefix=prefix)
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 234, in get_snaplist
allsnaps = client.snapshot.get(repository=repo_name, snapshot="_all")['snapshots']
File "/usr/lib/python2.7/site-packages/elasticsearch/client/utils.py", line 69, in _wrapped
return func(*args, params=params, **kwargs)
File "/usr/lib/python2.7/site-packages/elasticsearch/client/snapshot.py", line 58, in get
repository, snapshot), params=params)
File "/usr/lib/python2.7/site-packages/elasticsearch/transport.py", line 307, in perform_request
status, headers, data = connection.perform_request(method, url, params, body, ignore=ignore, timeout=timeout)
File "/usr/lib/python2.7/site-packages/elasticsearch/connection/http_urllib3.py", line 93, in perform_request
self._raise_error(response.status, raw_data)
File "/usr/lib/python2.7/site-packages/elasticsearch/connection/base.py", line 105, in _raise_error
raise HTTP_EXCEPTIONS.get(status_code, TransportError)(status_code, error_message, additional_info)
elasticsearch.exceptions.TransportError: TransportError(500, u'RemoteTransportException[[d1e4c296-b0e6-4d29-a854-62197b986998][inet[/192.168.136.131:9300]][cluster:admin/snapshot/get]]; nested: OutOfMemoryError[Java heap space]; ')
I can see in the ElasticSearch config that the heap size is:
ES_HEAP_SIZE=$(expr $(free -m|awk '/^Mem:/{print $2}') / 2 )m
Both servers have 16GB of memory with 36% free.
So close to getting the proof of concept done and tested so I can purchase and put into production.
Any help would be appreciated.
It managed to backup the first days backup then stopped working. No "backup snapshots" are present in Backup and Maintenance.
Both servers can access the share. I have reset all jobs and rerun but still doesn't work.
I can see in the logs that it shows the following:
2015-11-25 11:55:05,553 INFO Job starting...
2015-11-25 11:55:05,556 INFO Beginning SNAPSHOT operations...
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 736, in <module>
main()
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 731, in main
arguments.func(client, **argdict)
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 566, in command_loop
snapshot_list = get_object_list(client, data_type='snapshot', **kwargs)
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 279, in get_object_list
object_list = get_snaplist(client, repository, prefix=prefix)
File "/usr/lib/python2.7/site-packages/curator/curator.py", line 234, in get_snaplist
allsnaps = client.snapshot.get(repository=repo_name, snapshot="_all")['snapshots']
File "/usr/lib/python2.7/site-packages/elasticsearch/client/utils.py", line 69, in _wrapped
return func(*args, params=params, **kwargs)
File "/usr/lib/python2.7/site-packages/elasticsearch/client/snapshot.py", line 58, in get
repository, snapshot), params=params)
File "/usr/lib/python2.7/site-packages/elasticsearch/transport.py", line 307, in perform_request
status, headers, data = connection.perform_request(method, url, params, body, ignore=ignore, timeout=timeout)
File "/usr/lib/python2.7/site-packages/elasticsearch/connection/http_urllib3.py", line 93, in perform_request
self._raise_error(response.status, raw_data)
File "/usr/lib/python2.7/site-packages/elasticsearch/connection/base.py", line 105, in _raise_error
raise HTTP_EXCEPTIONS.get(status_code, TransportError)(status_code, error_message, additional_info)
elasticsearch.exceptions.TransportError: TransportError(500, u'RemoteTransportException[[d1e4c296-b0e6-4d29-a854-62197b986998][inet[/192.168.136.131:9300]][cluster:admin/snapshot/get]]; nested: OutOfMemoryError[Java heap space]; ')
I can see in the ElasticSearch config that the heap size is:
ES_HEAP_SIZE=$(expr $(free -m|awk '/^Mem:/{print $2}') / 2 )m
Both servers have 16GB of memory with 36% free.
So close to getting the proof of concept done and tested so I can purchase and put into production.
Any help would be appreciated.