[SPARK-12025] [SparkR] Rename some window rank function names for SparkR #10016

yanboliang · 2015-11-27T11:13:18Z

Change cumeDist -> cume_dist, denseRank -> dense_rank, percentRank -> percent_rank, rowNumber -> row_number at SparkR side.
There are two reasons that we should make this change:

We should follow the naming convention rule of R
Spark DataFrame has deprecated the old convention (such as cumeDist) and will remove it in Spark 2.0.

It's better to fix this issue before 1.6 release, otherwise we will make breaking API change.
cc @shivaram @sun-rui

SparkQA · 2015-11-27T11:41:45Z

Test build #46815 has finished for PR 10016 at commit c56a7bc.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

sun-rui · 2015-11-27T12:28:23Z

code change looks good. New names have no conflict with R base package.

But I don't see they are deprecated now? Could you point to me the discussion or JIRA that they will be removed in Spark 2.0?
If they are deprecated, I am fine with the name change.
If they are not deprecated, could we add new names as aliases, while keeping the old names at the same time?

yanboliang · 2015-11-27T13:27:08Z

@sun-rui Thanks for your comments. You can get the issue from the source code and this PR(#9930).

sun-rui · 2015-11-27T14:33:44Z

@yanboliang, thanks for your info. I didn't look at the latest code:)

LGTM

shivaram · 2015-11-27T19:47:22Z

LGTM. Thanks @yanboliang -- BTW were these APIs were present in SparkR 1.5 ? If so we'll need to update the release docs about this change.

Change ```cumeDist -> cume_dist, denseRank -> dense_rank, percentRank -> percent_rank, rowNumber -> row_number``` at SparkR side. There are two reasons that we should make this change: * We should follow the [naming convention rule of R](http://www.inside-r.org/node/230645) * Spark DataFrame has deprecated the old convention (such as ```cumeDist```) and will remove it in Spark 2.0. It's better to fix this issue before 1.6 release, otherwise we will make breaking API change. cc shivaram sun-rui Author: Yanbo Liang <[email protected]> Closes #10016 from yanboliang/SPARK-12025. (cherry picked from commit ba02f6c) Signed-off-by: Shivaram Venkataraman <[email protected]>

felixcheung · 2015-11-27T23:28:37Z

@shivaram they were added only for Spark 1.6, so no need to update release doc on breaking changes
dc3220c
40c77fb

But the new names seem to mask methods from dplyr.

Rename window rank function names for SparkR

c56a7bc

asfgit closed this in ba02f6c Nov 27, 2015

yanboliang deleted the SPARK-12025 branch November 29, 2015 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-12025] [SparkR] Rename some window rank function names for SparkR #10016

[SPARK-12025] [SparkR] Rename some window rank function names for SparkR #10016

Uh oh!

yanboliang commented Nov 27, 2015

Uh oh!

SparkQA commented Nov 27, 2015

Uh oh!

sun-rui commented Nov 27, 2015

Uh oh!

yanboliang commented Nov 27, 2015

Uh oh!

sun-rui commented Nov 27, 2015

Uh oh!

shivaram commented Nov 27, 2015

Uh oh!

felixcheung commented Nov 27, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-12025] [SparkR] Rename some window rank function names for SparkR #10016

[SPARK-12025] [SparkR] Rename some window rank function names for SparkR #10016

Uh oh!

Conversation

yanboliang commented Nov 27, 2015

Uh oh!

SparkQA commented Nov 27, 2015

Uh oh!

sun-rui commented Nov 27, 2015

Uh oh!

yanboliang commented Nov 27, 2015

Uh oh!

sun-rui commented Nov 27, 2015

Uh oh!

shivaram commented Nov 27, 2015

Uh oh!

felixcheung commented Nov 27, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants