[SPARK-12077] [SQL] change the default plan for single distinct #10075

davies · 2015-12-01T21:33:27Z

Use try to match the behavior for single distinct aggregation with Spark 1.5, but that's not scalable, we should be robust by default, have a flag to address performance regression for low cardinality aggregation.

cc @yhuai @nongli

SparkQA · 2015-12-01T22:11:33Z

Test build #46994 has finished for PR 10075 at commit 192a04d.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2015-12-01T23:38:19Z

sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala

I actually prefer the old name better. Two years from now, I'm not sure how much context we will have about "1.5".

After the discussion with @nongli @yhuai , we agreed that the only reason we have this flag here is to address the concern about performance regression for single distinct on low cardinality aggregation since 1.5, also this is the way Oracle usually did.

I think the big difference is that Oracle releases every x years, while we release every 3 month...

But we are doing the same thing as maintaining compatibility and no (much) regressions, otherwise we should just remove this flag (it's not public right now).

This reverts commit 192a04d.

yhuai · 2015-12-02T01:01:04Z

LGTM pending jenkins.

SparkQA · 2015-12-02T01:34:01Z

Test build #2137 has finished for PR 10075 at commit 192a04d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-12-02T02:50:05Z

Test build #47023 has finished for PR 10075 at commit 0a237b9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-12-02T02:50:05Z

Test build #2142 has finished for PR 10075 at commit 893b327.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

yhuai · 2015-12-02T04:15:05Z

LGTM. Merging to master and branch 1.6.

Use try to match the behavior for single distinct aggregation with Spark 1.5, but that's not scalable, we should be robust by default, have a flag to address performance regression for low cardinality aggregation. cc yhuai nongli Author: Davies Liu <[email protected]> Closes #10075 from davies/agg_15. (cherry picked from commit 96691fe) Signed-off-by: Yin Huai <[email protected]>

change the default plan for single distinct

192a04d

fix test

893b327

rxin reviewed Dec 1, 2015
View reviewed changes

davies changed the title ~~[SPARK-10277] [SQL] change the default plan for single distinct~~ [SPARK-12077] [SQL] change the default plan for single distinct Dec 2, 2015

Davies Liu added 2 commits December 1, 2015 16:55

Revert "change the default plan for single distinct"

306b8de

This reverts commit 192a04d.

change default

0a237b9

asfgit closed this in 96691fe Dec 2, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-12077] [SQL] change the default plan for single distinct #10075

[SPARK-12077] [SQL] change the default plan for single distinct #10075

Uh oh!

davies commented Dec 1, 2015

Uh oh!

SparkQA commented Dec 1, 2015

Uh oh!

rxin Dec 1, 2015

Uh oh!

davies Dec 1, 2015

Uh oh!

rxin Dec 1, 2015

Uh oh!

davies Dec 2, 2015

Uh oh!

yhuai commented Dec 2, 2015

Uh oh!

SparkQA commented Dec 2, 2015

Uh oh!

SparkQA commented Dec 2, 2015

Uh oh!

SparkQA commented Dec 2, 2015

Uh oh!

yhuai commented Dec 2, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-12077] [SQL] change the default plan for single distinct #10075

[SPARK-12077] [SQL] change the default plan for single distinct #10075

Uh oh!

Conversation

davies commented Dec 1, 2015

Uh oh!

SparkQA commented Dec 1, 2015

Uh oh!

rxin Dec 1, 2015

Choose a reason for hiding this comment

Uh oh!

davies Dec 1, 2015

Choose a reason for hiding this comment

Uh oh!

rxin Dec 1, 2015

Choose a reason for hiding this comment

Uh oh!

davies Dec 2, 2015

Choose a reason for hiding this comment

Uh oh!

yhuai commented Dec 2, 2015

Uh oh!

SparkQA commented Dec 2, 2015

Uh oh!

SparkQA commented Dec 2, 2015

Uh oh!

SparkQA commented Dec 2, 2015

Uh oh!

yhuai commented Dec 2, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants