Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bulk loader uses too much memory #3017

Closed
codexnull opened this issue Feb 14, 2019 · 3 comments
Closed

Bulk loader uses too much memory #3017

codexnull opened this issue Feb 14, 2019 · 3 comments
Labels
area/bulk-loader Issues related to bulk loading. area/performance Performance related issues. investigate Requires further investigation kind/enhancement Something could be better. status/accepted We accept to investigate/work on it. status/needs-attention This issue needs more eyes on it, more investigation might be required before accepting/rejecting it

Comments

@codexnull
Copy link
Contributor

The bulk loader's memory usage can grow beyond the size of physical RAM and cause swap space to be used. As the image below shows, memory usage grows monotonically during the map phase, is flat during the reduce phase, and is all finally released at exit.

image

The system does not appear to thrash, though, which suggests the memory is just being held by the process without it actively being accessed. Still, a big enough load may cause it to grow greater than RAM + swap and cause system instability or crashes.

Note the system was idle other than for dgraph. I did not test what would happen if concurrent processes had significant memory demands as well.

@codexnull codexnull self-assigned this Feb 14, 2019
@codexnull codexnull added the investigate Requires further investigation label Feb 14, 2019
@campoy campoy added area/bulk-loader Issues related to bulk loading. area/performance Performance related issues. kind/enhancement Something could be better. status/accepted We accept to investigate/work on it. status/needs-attention This issue needs more eyes on it, more investigation might be required before accepting/rejecting it labels Sep 13, 2019
@shekarm
Copy link

shekarm commented Nov 20, 2019

Hi,

Thank you for the post. We have made some improvements in our software (recently) which will likely exhibit different behavior -- in other words, much less memory usage. Keep you posted on developments. Meanwhile, if you haven't already done so, would you be able to share the characteristics of your data set that you are using?

Thanks

@ashish-goswami
Copy link
Contributor

I think this issue can be closed after we fix #5361

@minhaj-shakeel
Copy link
Contributor

Github issues have been deprecated.
This issue has been moved to discuss. You can follow the conversation there and also subscribe to updates by changing your notification preferences.

drawing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/bulk-loader Issues related to bulk loading. area/performance Performance related issues. investigate Requires further investigation kind/enhancement Something could be better. status/accepted We accept to investigate/work on it. status/needs-attention This issue needs more eyes on it, more investigation might be required before accepting/rejecting it
Development

No branches or pull requests

5 participants