Skip to content

[SPARK-6330] Fix filesystem bug in newParquet relation #5020

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

vlyubin
Copy link
Contributor

@vlyubin vlyubin commented Mar 13, 2015

If I'm running this locally and my path points to S3, this would currently error out because of incorrect FS.
I tested this in a scenario that previously didn't work, this change seemed to fix the issue.

@@ -262,6 +262,7 @@ private[sql] case class ParquetRelation2(

// Lists `FileStatus`es of all leaf nodes (files) under all base directories.
val leaves = baseStatuses.flatMap { f =>
val fs = FileSystem.get(URI.create(f.getPath.toString), sparkContext.hadoopConfiguration)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If f here is a FileStatus, then I think you can do f.getPath.toUri, perhaps?

@SparkQA
Copy link

SparkQA commented Mar 13, 2015

Test build #28592 has started for PR 5020 at commit bae5433.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Mar 13, 2015

Test build #28594 has started for PR 5020 at commit a645ad5.

  • This patch merges cleanly.

@aarondav
Copy link
Contributor

LGTM, thanks!

@SparkQA
Copy link

SparkQA commented Mar 13, 2015

Test build #28592 has finished for PR 5020 at commit bae5433.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28592/
Test PASSed.

@SparkQA
Copy link

SparkQA commented Mar 13, 2015

Test build #28594 has finished for PR 5020 at commit a645ad5.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28594/
Test PASSed.

@asfgit asfgit closed this in d19efed Mar 16, 2015
asfgit pushed a commit that referenced this pull request Mar 16, 2015
If I'm running this locally and my path points to S3, this would currently error out because of incorrect FS.
I tested this in a scenario that previously didn't work, this change seemed to fix the issue.

Author: Volodymyr Lyubinets <vlyubin@gmail.com>

Closes #5020 from vlyubin/parquertbug and squashes the following commits:

a645ad5 [Volodymyr Lyubinets] Fix filesystem bug in newParquet relation
@aarondav
Copy link
Contributor

Merged into master and branch-1.3, thanks!

christopherbozeman pushed a commit to christopherbozeman/spark that referenced this pull request Mar 25, 2015
If I'm running this locally and my path points to S3, this would currently error out because of incorrect FS.
I tested this in a scenario that previously didn't work, this change seemed to fix the issue.

Author: Volodymyr Lyubinets <vlyubin@gmail.com>

Closes apache#5020 from vlyubin/parquertbug and squashes the following commits:

a645ad5 [Volodymyr Lyubinets] Fix filesystem bug in newParquet relation
mingyukim pushed a commit to palantir/spark that referenced this pull request Mar 27, 2015
If I'm running this locally and my path points to S3, this would currently error out because of incorrect FS.
I tested this in a scenario that previously didn't work, this change seemed to fix the issue.

Author: Volodymyr Lyubinets <vlyubin@gmail.com>

Closes apache#5020 from vlyubin/parquertbug and squashes the following commits:

a645ad5 [Volodymyr Lyubinets] Fix filesystem bug in newParquet relation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants