[SPARK-7626][SQL] Consider the columns in Hive table but not in partition when filling values #6146

viirya · 2015-05-14T10:16:56Z

JIRA: https://issues.apache.org/jira/browse/SPARK-7626

…values.

SparkQA · 2015-05-14T12:16:40Z

Test build #32707 has finished for PR 6146 at commit 4dec469.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-06-17T21:36:08Z

Please explain your change. Neither the JIRA nor the PR description describe why you the original code is incorrect.

viirya · 2015-06-19T10:21:02Z

sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala

When a non-partition key attribution doesn't exist in the partition's (determined by the partition's StructObjectInspector), we should produce a null ref. Previously, we don't check it and directly call getStructFieldRef to query the field ref. It will cause the error reported in the JIRA.

Conflicts: sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala

SparkQA · 2015-06-19T17:13:16Z

Test build #35281 has finished for PR 6146 at commit 21e3c2c.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-06-19T19:26:11Z

Test build #35292 has finished for PR 6146 at commit eed7c8b.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class SerializableConfiguration(@transient var value: Configuration) extends Serializable
- class SerializableJobConf(@transient var value: JobConf) extends Serializable

marmbrus · 2015-06-19T19:45:58Z

What about a test case?

viirya · 2015-06-20T17:27:06Z

@marmbrus Although the code looks problematic, I mentioned on the JIRA that I can't reproduce the problem. By more testing and searching codes, I found that in your PR #5876 (https://github.com/apache/spark/pull/5876/files#diff-ee66e11b56c21364760a5ed2b783f863R620) you modified how to produce hive partition objects where you populate partition's schema from table schema.

So any newly added columns in hive table will appear in its partitions' schema too. That is why I can't produce this problem. Thus, all non-partition key attributions will always exist in table partitions. And this reported bug is solved by the PR #5876. Because of that, I think I can close this PR now.

marmbrus · 2015-06-22T18:49:18Z

Thanks for following up!

Consider the columns in Hive table but not in partition when filling …

4dec469

…values.

viirya reviewed Jun 19, 2015
View reviewed changes

Merge remote-tracking branch 'upstream/master' into skip_new_column

21e3c2c

Conflicts: sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala

Restore overwritten.

eed7c8b

viirya closed this Jun 20, 2015

viirya deleted the skip_new_column branch December 27, 2023 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-7626][SQL] Consider the columns in Hive table but not in partition when filling values #6146

[SPARK-7626][SQL] Consider the columns in Hive table but not in partition when filling values #6146

Uh oh!

viirya commented May 14, 2015

Uh oh!

SparkQA commented May 14, 2015

Uh oh!

marmbrus commented Jun 17, 2015

Uh oh!

viirya Jun 19, 2015

Uh oh!

SparkQA commented Jun 19, 2015

Uh oh!

SparkQA commented Jun 19, 2015

Uh oh!

marmbrus commented Jun 19, 2015

Uh oh!

viirya commented Jun 20, 2015

Uh oh!

marmbrus commented Jun 22, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-7626][SQL] Consider the columns in Hive table but not in partition when filling values #6146

[SPARK-7626][SQL] Consider the columns in Hive table but not in partition when filling values #6146

Uh oh!

Conversation

viirya commented May 14, 2015

Uh oh!

SparkQA commented May 14, 2015

Uh oh!

marmbrus commented Jun 17, 2015

Uh oh!

viirya Jun 19, 2015

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jun 19, 2015

Uh oh!

SparkQA commented Jun 19, 2015

Uh oh!

marmbrus commented Jun 19, 2015

Uh oh!

viirya commented Jun 20, 2015

Uh oh!

marmbrus commented Jun 22, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants