[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark #4831

yanboliang · 2015-02-28T08:30:17Z

Currently LogisticRegressionWithLBFGS in python/pyspark/mllib/classification.py will invoke callMLlibFunc with a wrong "regType" parameter.
It was assigned to "str(regType)" which translate None(Python) to "None"(Java/Scala). The right way should be translate None(Python) to null(Java/Scala) just as what we did at LogisticRegressionWithSGD.

AmplabJenkins · 2015-02-28T08:32:09Z

Can one of the admins verify this patch?

mengxr · 2015-03-01T18:41:36Z

add to whitelist

mengxr · 2015-03-01T18:41:41Z

ok to test

SparkQA · 2015-03-01T18:42:40Z

Test build #28150 has started for PR 4831 at commit 12db65a.

This patch merges cleanly.

mengxr · 2015-03-01T18:44:48Z

python/pyspark/mllib/classification.py

Shall we use str(regType) if regType else None?

I think directly use regType is enough, because py4j can translate "l1","l2",None in Python to "l1","l2",null in Java/Scala smoothly.
I found the peer functions LogisticRegressionWithSGD and SVMWithSGD also directly use regType and it can work well.

The difference is the error message. If we use str(...), the Java function call would be successful and inside the Java function, we say the input regType is not recognized.

If we don't use str(..), if users input something like 1, 2. Py4j will throw an error saying there is no Java function matching the input signature, which usually confuses users.

Anyway, this is a minor issue for regType.

SparkQA · 2015-03-01T20:01:32Z

Test build #28150 has finished for PR 4831 at commit 12db65a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-01T20:01:36Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28150/
Test PASSed.

mengxr · 2015-03-02T18:18:13Z

LGTM. Merged into master and branch-1.3. Thanks!

…rameter for pyspark Currently LogisticRegressionWithLBFGS in python/pyspark/mllib/classification.py will invoke callMLlibFunc with a wrong "regType" parameter. It was assigned to "str(regType)" which translate None(Python) to "None"(Java/Scala). The right way should be translate None(Python) to null(Java/Scala) just as what we did at LogisticRegressionWithSGD. Author: Yanbo Liang <[email protected]> Closes #4831 from yanboliang/pyspark_classification and squashes the following commits: 12db65a [Yanbo Liang] correct LogisticRegressionWithLBFGS regType parameter for pyspark (cherry picked from commit af2effd) Signed-off-by: Xiangrui Meng <[email protected]>

correct LogisticRegressionWithLBFGS regType parameter for pyspark

12db65a

yanboliang changed the title ~~correct LogisticRegressionWithLBFGS regType parameter for pyspark~~ [SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark Feb 28, 2015

mengxr reviewed Mar 1, 2015
View reviewed changes

asfgit closed this in af2effd Mar 2, 2015

yanboliang deleted the pyspark_classification branch April 24, 2015 10:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark #4831

[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark #4831

Uh oh!

yanboliang commented Feb 28, 2015

Uh oh!

AmplabJenkins commented Feb 28, 2015

Uh oh!

mengxr commented Mar 1, 2015

Uh oh!

mengxr commented Mar 1, 2015

Uh oh!

SparkQA commented Mar 1, 2015

Uh oh!

mengxr Mar 1, 2015

Uh oh!

yanboliang Mar 2, 2015

Uh oh!

mengxr Mar 2, 2015

Uh oh!

SparkQA commented Mar 1, 2015

Uh oh!

AmplabJenkins commented Mar 1, 2015

Uh oh!

mengxr commented Mar 2, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark #4831

[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark #4831

Uh oh!

Conversation

yanboliang commented Feb 28, 2015

Uh oh!

AmplabJenkins commented Feb 28, 2015

Uh oh!

mengxr commented Mar 1, 2015

Uh oh!

mengxr commented Mar 1, 2015

Uh oh!

SparkQA commented Mar 1, 2015

Uh oh!

mengxr Mar 1, 2015

Choose a reason for hiding this comment

Uh oh!

yanboliang Mar 2, 2015

Choose a reason for hiding this comment

Uh oh!

mengxr Mar 2, 2015

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Mar 1, 2015

Uh oh!

AmplabJenkins commented Mar 1, 2015

Uh oh!

mengxr commented Mar 2, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants