[SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark #12789

thunterdb · 2016-04-29T22:14:58Z

What changes were proposed in this pull request?

This PR splits the MLlib algorithms into two flavors:

the R flavor, which tries to mimic the existing R API for these algorithms (and works as an S4 specialization for Spark dataframes)
the Spark flavor, which follows the same API and naming conventions as the rest of the MLlib algorithms in the other languages

In practice, the former calls the latter.

How was this patch tested?

The tests for the various algorithms were adapted to be run against both interfaces.

SparkQA · 2016-04-30T00:12:00Z

Test build #57371 has finished for PR 12789 at commit 46d9d68.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-04-30T03:17:36Z

Test build #57386 has finished for PR 12789 at commit 3b176d9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-04-30T04:32:47Z

Test build #57407 has finished for PR 12789 at commit f57db82.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

mengxr · 2016-04-30T06:14:25Z

I merged this into master. Thanks! There are some minor issues that I will address in a follow-up PR:

ml.save/load should be renamed to read.ml and write.ml to be consistent with read.df and write.df
the param data should be called df

## What changes were proposed in this pull request? Continue the work of #12789 to rename ml.asve/ml.load to write.ml/read.ml, which are more consistent with read.df/write.df and other methods in SparkR. I didn't rename `data` to `df` because we still use `predict` for prediction, which uses `newData` to match the signature in R. ## How was this patch tested? Existing unit tests. cc: yanboliang thunterdb Author: Xiangrui Meng <[email protected]> Closes #12807 from mengxr/SPARK-14831.

thunterdb added 5 commits April 29, 2016 12:18

started with glm

93d4b56

trying to add a test

88b3f22

trying this

46d9d68

more work

1aadc72

adding the rest of the tests

822b6ab

thunterdb added 3 commits April 29, 2016 17:59

adding k-means

ea0bba0

adding k-means test

ea80f0c

adding spark.kmeans and spark.survreg

3b176d9

thunterdb changed the title ~~[WIP][SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark~~ [SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark Apr 30, 2016

thunterdb added 4 commits April 29, 2016 21:08

misunderstanding

8789cf1

mistake

82a75c1

doc fixes

f284faa

remove some changes

f57db82

asfgit closed this in bc36fe6 Apr 30, 2016

mengxr mentioned this pull request Apr 30, 2016

[SPARK-14831.2] [ML] [R] rename ml.save/ml.load to write.ml/read.ml #12807

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark #12789

[SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark #12789

Uh oh!

thunterdb commented Apr 29, 2016 •

edited

Loading

Uh oh!

SparkQA commented Apr 30, 2016

Uh oh!

SparkQA commented Apr 30, 2016

Uh oh!

SparkQA commented Apr 30, 2016

Uh oh!

mengxr commented Apr 30, 2016 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark #12789

[SPARKR][SPARK-14831]Make the SparkR MLlib API more consistent with Spark #12789

Uh oh!

Conversation

thunterdb commented Apr 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Apr 30, 2016

Uh oh!

SparkQA commented Apr 30, 2016

Uh oh!

SparkQA commented Apr 30, 2016

Uh oh!

mengxr commented Apr 30, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

thunterdb commented Apr 29, 2016 •

edited

Loading

mengxr commented Apr 30, 2016 •

edited

Loading