-
Notifications
You must be signed in to change notification settings - Fork 445
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
prototype of bulk import v2 distributed file examination #4898
Draft
keith-turner
wants to merge
21
commits into
apache:2.1
Choose a base branch
from
keith-turner:bulk-load-improvement
base: 2.1
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from 3 commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
aa593ad
prototype of bulk import v2 distributed file examination
keith-turner 407358a
format code
keith-turner fd70d34
fix build
keith-turner 058328a
remove tight coupling to SortedSet by adding simple indirection
keith-turner c8c5f21
format code
keith-turner 9ef0bcf
use Rfile api to read
keith-turner 368b2a4
adds cache
keith-turner e228b68
adds ability to constuct load plan while writing to rfile
keith-turner 9328277
adds tests and javadoc
keith-turner 3190d19
fail build when local mods
keith-turner 174b4e0
update pom for including sha in version
keith-turner f82d111
revert pom change
keith-turner 9c7dc66
Revert "update pom for including sha in version"
keith-turner 4285753
cleanup
keith-turner ec0febb
more cleanup
keith-turner 97e4684
fix validation bug
keith-turner aabe2d8
Update core/src/main/java/org/apache/accumulo/core/data/LoadPlan.java
keith-turner 2003eae
Update core/src/test/java/org/apache/accumulo/core/data/LoadPlanTest.…
keith-turner 667f12e
Update core/src/test/java/org/apache/accumulo/core/data/LoadPlanTest.…
keith-turner a5ead55
Update core/src/test/java/org/apache/accumulo/core/data/LoadPlanTest.…
keith-turner 926dec7
sync w/ 3.1 changes
keith-turner File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure of the best way forward here. As a design goal was attempting to make this compute method independent of something like an accumulo client and a client context, however ran into a problem with that design goal with the crypto service.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can look at how the rfile PrintInfo command does this. It calls:
As a server-side utility, you could make certain assumptions that it has the ability to read the accumulo properties file on the server side, like that utility does. However, as a purely client-side API, you may need to just pass in the CryptoService directly, or pass in other options, so it can set up the right config (crypto, compression, etc.) to be able to read the files.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Made an update in 9ef0bcf to pass in a map of props which is passed to the Rfile api which internally calls
CryptoFactoryLoader.getServiceForClient
using that map of properties.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That could work, since it's a static entry point to building a load plan. If it was dangling off AccumuloClient, users might expect client properties to be passed. But for the static entry point, I think it's reasonable to require them to be provided explicitly.
I like how you were able to use the existing RFile.newScanner() code.