-
Couldn't load subscription status.
- Fork 3.9k
ARROW-3920: [plasma] Fix reference counting in custom tensorflow plasma operator. #3061
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ARROW-3920: [plasma] Fix reference counting in custom tensorflow plasma operator. #3061
Conversation
|
Thoughts:
|
|
Thanks @concretevitamin!
|
a4ff374 to
4836342
Compare
| ARROW_CHECK_OK(client_.Release(object_id)); | ||
| auto orig_stream = context->op_device_context()->stream(); | ||
| auto stream_executor = orig_stream->parent(); | ||
| CHECK(stream_executor->HostMemoryUnregister(static_cast<void*>(data))); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We probably only want this to happen in the GPU case, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same with the other op.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I updated it (it actually doesn't compile for cpu only tensorflow).
Codecov Report
@@ Coverage Diff @@
## master #3061 +/- ##
==========================================
- Coverage 87.07% 87.07% -0.01%
==========================================
Files 489 489
Lines 69160 69161 +1
==========================================
- Hits 60222 60221 -1
- Misses 8837 8839 +2
Partials 101 101
Continue to review full report at Codecov.
|
This includes a fix so the TensorFlow op releases memory properly (apache/arrow#3061) and the possibility to store arrow data structures in plasma (apache/arrow#2832). #3404
There is an issue here where
Releasewas never being called in the plasma TF operator.Note that I also changed the release delay in the plasma operator to 0.