Skip to content

Conversation

@yhuai
Copy link
Contributor

@yhuai yhuai commented Feb 17, 2015

The problem is that after we create an empty hive metastore parquet table (e.g. CREATE TABLE test (a int) STORED AS PARQUET), Hive will create an empty dir for us, which cause our data source ParquetRelation2 fail to get the schema of the table. See JIRA for the case to reproduce the bug and the exception.

This PR is based on #4562 from @chenghao-intel.

JIRA: https://issues.apache.org/jira/browse/SPARK-5852

@SparkQA
Copy link

SparkQA commented Feb 17, 2015

Test build #27640 has started for PR 4655 at commit b8b3450.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 17, 2015

Test build #27640 has finished for PR 4655 at commit b8b3450.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27640/
Test PASSed.

asfgit pushed a commit that referenced this pull request Feb 17, 2015
…uet table to a data source parquet table.

The problem is that after we create an empty hive metastore parquet table (e.g. `CREATE TABLE test (a int) STORED AS PARQUET`), Hive will create an empty dir for us, which cause our data source `ParquetRelation2` fail to get the schema of the table. See JIRA for the case to reproduce the bug and the exception.

This PR is based on #4562 from chenghao-intel.

JIRA: https://issues.apache.org/jira/browse/SPARK-5852

Author: Yin Huai <[email protected]>
Author: Cheng Hao <[email protected]>

Closes #4655 from yhuai/CTASParquet and squashes the following commits:

b8b3450 [Yin Huai] Update tests.
2ac94f7 [Yin Huai] Update tests.
3db3d20 [Yin Huai] Minor update.
d7e2308 [Yin Huai] Revert changes in HiveMetastoreCatalog.scala.
36978d1 [Cheng Hao] Update the code as feedback
a04930b [Cheng Hao] fix bug of scan an empty parquet based table
442ffe0 [Cheng Hao] passdown the schema for Parquet File in HiveContext

(cherry picked from commit 117121a)
Signed-off-by: Michael Armbrust <[email protected]>
@asfgit asfgit closed this in 117121a Feb 17, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants