Skip to content
This repository has been archived by the owner on Apr 27, 2018. It is now read-only.

Memory Issues on Large WARC Files #254

Open
ianmilligan1 opened this issue Oct 11, 2016 · 0 comments
Open

Memory Issues on Large WARC Files #254

ianmilligan1 opened this issue Oct 11, 2016 · 0 comments

Comments

@ianmilligan1
Copy link
Collaborator

I've been tinkering around with @dportabella's #246 issue, as we also have some very large WARCs in a collection (i.e. some of 7GB, others of 40,50,60GB). We do run into Java Heap Space issues w/ large WARC files.

Most of our development has focused on standard-size Archive-It files, i.e. ~ 1 GB, but looks like there are lots of larger ones out there.

Is there any tweak we can make to loadArchives to better parse large WARC files?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant