[SPARK-23523] [SQL] [FOLLOWUP] Minor refactor of OptimizeMetadataOnlyQuery #20693

jiangxb1987 · 2018-02-28T12:47:38Z

What changes were proposed in this pull request?

Inside OptimizeMetadataOnlyQuery.getPartitionAttrs, avoid using zip to generate attribute map.
Also include other minor update of comments and format.

How was this patch tested?

Existing test cases.

jiangxb1987 · 2018-02-28T12:50:51Z

cc @gatorsmile @cloud-fan

hvanhovell · 2018-02-28T13:28:23Z

LGTM - pending jenkins

SparkQA · 2018-02-28T14:33:38Z

Test build #87775 has finished for PR 20693 at commit a3cf3ca.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class LocalRelation(output: Seq[Attribute],

cloud-fan · 2018-02-28T15:10:34Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/LocalRelation.scala

-    data: Seq[InternalRow] = Nil,
-    // Indicates whether this relation has data from a streaming source.
-    override val isStreaming: Boolean = false)
+case class LocalRelation(output: Seq[Attribute],


although we should not include this style change in the original commit, since it's already there, let's not bother about reverting it back.

cloud-fan · 2018-02-28T15:10:57Z

retest this please

cloud-fan · 2018-02-28T16:38:41Z

LGTM

gatorsmile

LGTM

SparkQA · 2018-02-28T18:29:56Z

Test build #87783 has finished for PR 20693 at commit a3cf3ca.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class LocalRelation(output: Seq[Attribute],

SparkQA · 2018-02-28T18:53:48Z

Test build #87787 has finished for PR 20693 at commit 0a3d84a.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class LocalRelation(

dongjoon-hyun

+1, LGTM.

## What changes were proposed in this pull request? Inside `OptimizeMetadataOnlyQuery.getPartitionAttrs`, avoid using `zip` to generate attribute map. Also include other minor update of comments and format. ## How was this patch tested? Existing test cases. Author: Xingbo Jiang <[email protected]> Closes apache#20693 from jiangxb1987/SPARK-23523.

…he rule OptimizeMetadataOnlyQuery This PR is to backport #20684 and #20693 to Spark 2.3 branch --- ## What changes were proposed in this pull request? ```Scala val tablePath = new File(s"${path.getCanonicalPath}/cOl3=c/cOl1=a/cOl5=e") Seq(("a", "b", "c", "d", "e")).toDF("cOl1", "cOl2", "cOl3", "cOl4", "cOl5") .write.json(tablePath.getCanonicalPath) val df = spark.read.json(path.getCanonicalPath).select("CoL1", "CoL5", "CoL3").distinct() df.show() ``` It generates a wrong result. ``` [c,e,a] ``` We have a bug in the rule `OptimizeMetadataOnlyQuery `. We should respect the attribute order in the original leaf node. This PR is to fix it. ## How was this patch tested? Added a test case Author: Xingbo Jiang <[email protected]> Author: gatorsmile <[email protected]> Closes #20763 from gatorsmile/backport23523.

## What changes were proposed in this pull request? Inside `OptimizeMetadataOnlyQuery.getPartitionAttrs`, avoid using `zip` to generate attribute map. Also include other minor update of comments and format. ## How was this patch tested? Existing test cases. Author: Xingbo Jiang <[email protected]> Closes apache#20693 from jiangxb1987/SPARK-23523.

refactor

a3cf3ca

cloud-fan reviewed Feb 28, 2018

View reviewed changes

revert changes

0a3d84a

gatorsmile approved these changes Feb 28, 2018

View reviewed changes

dongjoon-hyun approved these changes Feb 28, 2018

View reviewed changes

asfgit closed this in 25c2776 Feb 28, 2018

gatorsmile mentioned this pull request Mar 7, 2018

[SPARK-23523] [SQL] [BACKPORT-2.3] Fix the incorrect result caused by the rule OptimizeMetadataOnlyQuery #20763

Closed

mkressirer mentioned this pull request Mar 13, 2018

[SPARK-23523][SQL][BACKPORT-2.3] Fix the incorrect result caused by t… toasttab/spark#13

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-23523] [SQL] [FOLLOWUP] Minor refactor of OptimizeMetadataOnlyQuery #20693

[SPARK-23523] [SQL] [FOLLOWUP] Minor refactor of OptimizeMetadataOnlyQuery #20693

Uh oh!

jiangxb1987 commented Feb 28, 2018

Uh oh!

jiangxb1987 commented Feb 28, 2018

Uh oh!

hvanhovell commented Feb 28, 2018 •

edited

Loading

Uh oh!

SparkQA commented Feb 28, 2018

Uh oh!

cloud-fan Feb 28, 2018

Uh oh!

cloud-fan commented Feb 28, 2018

Uh oh!

cloud-fan commented Feb 28, 2018

Uh oh!

gatorsmile left a comment

Uh oh!

SparkQA commented Feb 28, 2018

Uh oh!

SparkQA commented Feb 28, 2018

Uh oh!

dongjoon-hyun left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-23523] [SQL] [FOLLOWUP] Minor refactor of OptimizeMetadataOnlyQuery #20693

[SPARK-23523] [SQL] [FOLLOWUP] Minor refactor of OptimizeMetadataOnlyQuery #20693

Uh oh!

Conversation

jiangxb1987 commented Feb 28, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

jiangxb1987 commented Feb 28, 2018

Uh oh!

hvanhovell commented Feb 28, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Feb 28, 2018

Uh oh!

cloud-fan Feb 28, 2018

Choose a reason for hiding this comment

Uh oh!

cloud-fan commented Feb 28, 2018

Uh oh!

cloud-fan commented Feb 28, 2018

Uh oh!

gatorsmile left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Feb 28, 2018

Uh oh!

SparkQA commented Feb 28, 2018

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

hvanhovell commented Feb 28, 2018 •

edited

Loading