Revert "[SPARK-35028][SQL] ANSI mode: disallow group by aliases" #33758

gengliangwang · 2021-08-17T04:46:41Z

What changes were proposed in this pull request?

Revert [SPARK-35028][SQL] ANSI mode: disallow group by aliases

Why are the changes needed?

It turns out that many users are using the group by alias feature. Spark has its precedence rule when alias names conflict with column names in Group by clause: always use the table column. This should be reasonable and acceptable.
Also, external DBMS such as PostgreSQL and MySQL allow grouping by alias, too.

As we are going to announce ANSI mode GA in Spark 3.2, I suggest allowing the group by alias in ANSI mode.

Does this PR introduce any user-facing change?

No, the feature is not released yet.

How was this patch tested?

Unit tests

gatorsmile · 2021-08-17T05:35:38Z

sql/core/src/test/resources/sql-tests/inputs/ansi/group-analytics.sql

@@ -1 +0,0 @@
--IMPORT group-analytics.sql


Do we need to remove the result file?

good catch!

Thanks, removed

SparkQA · 2021-08-17T05:39:13Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47034/

SparkQA · 2021-08-17T06:16:47Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47034/

SparkQA · 2021-08-17T09:29:02Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47049/

SparkQA · 2021-08-17T09:36:27Z

Test build #142533 has finished for PR 33758 at commit 76c697e.

This patch fails SparkR unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2021-08-17T10:08:03Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47049/

gengliangwang · 2021-08-17T12:22:40Z

Merging to master/3.2

### What changes were proposed in this pull request? Revert [[SPARK-35028][SQL] ANSI mode: disallow group by aliases ](#32129) ### Why are the changes needed? It turns out that many users are using the group by alias feature. Spark has its precedence rule when alias names conflict with column names in Group by clause: always use the table column. This should be reasonable and acceptable. Also, external DBMS such as PostgreSQL and MySQL allow grouping by alias, too. As we are going to announce ANSI mode GA in Spark 3.2, I suggest allowing the group by alias in ANSI mode. ### Does this PR introduce _any_ user-facing change? No, the feature is not released yet. ### How was this patch tested? Unit tests Closes #33758 from gengliangwang/revertGroupByAlias. Authored-by: Gengliang Wang <[email protected]> Signed-off-by: Gengliang Wang <[email protected]> (cherry picked from commit 8bfb4f1) Signed-off-by: Gengliang Wang <[email protected]>

SparkQA · 2021-08-17T13:50:32Z

Test build #142547 has finished for PR 33758 at commit 8e0aec3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2021-08-17T15:38:00Z

+1, LGTM.

### What changes were proposed in this pull request? Revert [[SPARK-35028][SQL] ANSI mode: disallow group by aliases ](apache#32129) ### Why are the changes needed? It turns out that many users are using the group by alias feature. Spark has its precedence rule when alias names conflict with column names in Group by clause: always use the table column. This should be reasonable and acceptable. Also, external DBMS such as PostgreSQL and MySQL allow grouping by alias, too. As we are going to announce ANSI mode GA in Spark 3.2, I suggest allowing the group by alias in ANSI mode. ### Does this PR introduce _any_ user-facing change? No, the feature is not released yet. ### How was this patch tested? Unit tests Closes apache#33758 from gengliangwang/revertGroupByAlias. Authored-by: Gengliang Wang <[email protected]> Signed-off-by: Gengliang Wang <[email protected]> (cherry picked from commit 8bfb4f1) Signed-off-by: Gengliang Wang <[email protected]>

revert

76c697e

github-actions bot added DOCS SQL labels Aug 17, 2021

gengliangwang requested review from HyukjinKwon, cloud-fan and maropu August 17, 2021 04:47

HyukjinKwon approved these changes Aug 17, 2021

View reviewed changes

cloud-fan approved these changes Aug 17, 2021

View reviewed changes

gatorsmile reviewed Aug 17, 2021

View reviewed changes

remove group-analytics.sql.out

8e0aec3

gengliangwang closed this in 8bfb4f1 Aug 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revert "[SPARK-35028][SQL] ANSI mode: disallow group by aliases" #33758

Revert "[SPARK-35028][SQL] ANSI mode: disallow group by aliases" #33758

Uh oh!

gengliangwang commented Aug 17, 2021 •

edited

Loading

Uh oh!

gatorsmile Aug 17, 2021

Uh oh!

cloud-fan Aug 17, 2021

Uh oh!

gengliangwang Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

gengliangwang commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

dongjoon-hyun commented Aug 17, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

		@@ -1 +0,0 @@
		--IMPORT group-analytics.sql No newline at end of file

Revert "[SPARK-35028][SQL] ANSI mode: disallow group by aliases" #33758

Revert "[SPARK-35028][SQL] ANSI mode: disallow group by aliases" #33758

Uh oh!

Conversation

gengliangwang commented Aug 17, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gatorsmile Aug 17, 2021

Choose a reason for hiding this comment

Uh oh!

cloud-fan Aug 17, 2021

Choose a reason for hiding this comment

Uh oh!

gengliangwang Aug 17, 2021

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

gengliangwang commented Aug 17, 2021

Uh oh!

SparkQA commented Aug 17, 2021

Uh oh!

dongjoon-hyun commented Aug 17, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

gengliangwang commented Aug 17, 2021 •

edited

Loading