Skip to content

Conversation

@yhuai
Copy link
Contributor

@yhuai yhuai commented Sep 18, 2015

Intersect and Except are both set operators and they use the all the columns to compare equality between rows. When pushing their Project parent down, the relations they based on would change, therefore not an equivalent transformation.

JIRA: https://issues.apache.org/jira/browse/SPARK-10539

I added some comments based on the fix of #8742.

@yhuai
Copy link
Contributor Author

yhuai commented Sep 18, 2015

The fix is good. I only added comments. Will merge it to both master and branch 1.5 once it passes jenkins.

@SparkQA
Copy link

SparkQA commented Sep 18, 2015

Test build #42675 has finished for PR 8823 at commit 1f56d2e.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@yhuai
Copy link
Contributor Author

yhuai commented Sep 18, 2015

Merging to master and branch 1.5. Thanks @yjshen

@asfgit asfgit closed this in c6f8135 Sep 18, 2015
asfgit pushed a commit that referenced this pull request Sep 18, 2015
…ct or Except #8742

Intersect and Except are both set operators and they use the all the columns to compare equality between rows. When pushing their Project parent down, the relations they based on would change, therefore not an equivalent transformation.

JIRA: https://issues.apache.org/jira/browse/SPARK-10539

I added some comments based on the fix of #8742.

Author: Yijie Shen <[email protected]>
Author: Yin Huai <[email protected]>

Closes #8823 from yhuai/fix_set_optimization.

(cherry picked from commit c6f8135)
Signed-off-by: Yin Huai <[email protected]>

Conflicts:
	sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala
ashangit pushed a commit to ashangit/spark that referenced this pull request Oct 19, 2016
…ct or Except apache#8742

Intersect and Except are both set operators and they use the all the columns to compare equality between rows. When pushing their Project parent down, the relations they based on would change, therefore not an equivalent transformation.

JIRA: https://issues.apache.org/jira/browse/SPARK-10539

I added some comments based on the fix of apache#8742.

Author: Yijie Shen <[email protected]>
Author: Yin Huai <[email protected]>

Closes apache#8823 from yhuai/fix_set_optimization.

(cherry picked from commit c6f8135)
Signed-off-by: Yin Huai <[email protected]>

Conflicts:
	sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala

(cherry picked from commit 3df52cc)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants