-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
There is insufficient memory for the Java Runtime Environment to continue #159
Comments
New error when re-running with swap space (I lowered the driver memory to 45GB).
|
@ianmilligan1 what happened with the swap? Did it all get used? |
I'm not sure, it didn't generate a nice error log this time. On the bright side, it failed sooner. |
Can you run it again, and watch |
Will do! |
OK failed after watching |
I've tried the following (and spent most of the afternoon watching logs, trying different things). In particular the following doesn't work:
We've had this issue since September 2016. lintool/warcbase#246 |
List of failed attempts:
Here's a full Spark log on the failure. |
I got it to work!
Limiting the number of cores - default was to go to all 16 worker threads, by reducing it to 12 it worked. Going to try to run all derivatives, and if it works, will close. |
Running locally on an Azure 16 core, 55GB machine, Ubuntu 16; analyzing a 293GB collection that has some large WARCs (i.e. 7GB). This machine has previously happily processed ~4TB collections. There's a lot of space on the machine which is a fairly vanilla VM.
This command is used to start
spark-shell
:And it failed on this relatively straightforward script to count domains:
The error is:
Error Logs
Here are the error logs: the terminal output and the hs_err_pid2492.log .
I will continue to tackle tomorrow, but any thoughts or guidance greatly appreciated.
The text was updated successfully, but these errors were encountered: