Skip to content

[SPARK-5852][SQL]Fail to convert a newly created empty metastore parquet table to a data source parquet table. #4655

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 7 commits into from

Conversation

yhuai
Copy link
Contributor

@yhuai yhuai commented Feb 17, 2015

The problem is that after we create an empty hive metastore parquet table (e.g. CREATE TABLE test (a int) STORED AS PARQUET), Hive will create an empty dir for us, which cause our data source ParquetRelation2 fail to get the schema of the table. See JIRA for the case to reproduce the bug and the exception.

This PR is based on #4562 from @chenghao-intel.

JIRA: https://issues.apache.org/jira/browse/SPARK-5852

@SparkQA
Copy link

SparkQA commented Feb 17, 2015

Test build #27640 has started for PR 4655 at commit b8b3450.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 17, 2015

Test build #27640 has finished for PR 4655 at commit b8b3450.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27640/
Test PASSed.

asfgit pushed a commit that referenced this pull request Feb 17, 2015
…uet table to a data source parquet table.

The problem is that after we create an empty hive metastore parquet table (e.g. `CREATE TABLE test (a int) STORED AS PARQUET`), Hive will create an empty dir for us, which cause our data source `ParquetRelation2` fail to get the schema of the table. See JIRA for the case to reproduce the bug and the exception.

This PR is based on #4562 from chenghao-intel.

JIRA: https://issues.apache.org/jira/browse/SPARK-5852

Author: Yin Huai <yhuai@databricks.com>
Author: Cheng Hao <hao.cheng@intel.com>

Closes #4655 from yhuai/CTASParquet and squashes the following commits:

b8b3450 [Yin Huai] Update tests.
2ac94f7 [Yin Huai] Update tests.
3db3d20 [Yin Huai] Minor update.
d7e2308 [Yin Huai] Revert changes in HiveMetastoreCatalog.scala.
36978d1 [Cheng Hao] Update the code as feedback
a04930b [Cheng Hao] fix bug of scan an empty parquet based table
442ffe0 [Cheng Hao] passdown the schema for Parquet File in HiveContext

(cherry picked from commit 117121a)
Signed-off-by: Michael Armbrust <michael@databricks.com>
@asfgit asfgit closed this in 117121a Feb 17, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants