-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[WIP][SPARK-2883][SQL]initial support ORC in spark sql #2576
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
45 commits
Select commit
Hold shift + click to select a range
fb14a06
initial support orc in spark sql
scwf ec3cdaf
add unit tests
scwf 7126290
add orc to example of spark sql
scwf 655b23f
merge with apache/master and fix conflict
scwf 1505af4
fix according comments and move orc to hive sub project
scwf 1db30b1
fix scala style
scwf 9529d68
address comments_1
scwf 25416c3
update with apache/spark master and fix conflicts
scwf 89421ef
log.info -> logInfo
scwf 40f9d68
address comments
scwf b2b5455
Merge branch 'master' of https://github.com/apache/spark into orc
scwf 349bb92
reuse TableReader
scwf 65483e8
fix InsertIntoOrcTable
scwf 629f95e
address comments
scwf d1bba23
added test for read orc schema
scwf b46ecb1
add test files
scwf f68bea0
scala style fixed
scwf b06e335
revert no related changes
scwf 7bdc503
Merge branch 'test' into orc
scwf 4d0950b
more tests
scwf 37a2192
tests suite fix
scwf 4e1c839
minor fix
scwf b8495d5
add test case for reading/righting empty RDD
scwf f680da0
no use method
scwf 269b15f
make wrap consistent with InsertIntoHiveTable.wrapperFor
zhzhan d06fe5b
Merge pull request #4 from zhzhan/orc1
scwf 9bd3c85
First draft.
marmbrus 5f5fda8
https -> http in pom
scwf 2421f24
Merge branch 'mvn' into parquet
scwf 5d7f863
logging / formatting improvements.
marmbrus 94e0d40
Add an experimental interface to data sources that exposes catalyst e…
marmbrus dd78aa7
Alternative implementation of parquet based on the datasources API.
marmbrus 244ab59
Merge branch 'newParquet' of https://github.com/marmbrus/spark into p…
scwf a37f6a8
wf comment
scwf 1d6856e
Merge branch 'master' of https://github.com/apache/spark into mvn
scwf b8e6f84
Merge branch 'parquet' into newOrc
scwf e521e6a
draft for datasource api
scwf 9ba04ac
Merge branch 'master' into newOrc
scwf c90ed2f
test case
scwf b6ae12b
fix test
scwf 3ca68eb
no used bin file
scwf abf1b78
update with apache master
scwf 1e0c1d9
Revert "https -> http in pom"
scwf c5236ef
ppd support
zhzhan 601d242
Merge pull request #13 from zhzhan/orc1
scwf File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think
FileSystem.globStatus
can be convenient and more efficient here.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But
globStatus
does not list the files of the path, here we should list the *.orc or *.parquet files under this dir.