-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-15444][PySpark][ML][HotFix] Default value mismatch of param linkPredictionCol for GeneralizedLinearRegression #13220
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@viirya ah thanks I should have picked up the discrepancy. cc @holdenk @yanboliang |
| "org.apache.spark.ml.regression.GeneralizedLinearRegression", self.uid) | ||
| self._setDefault(family="gaussian", maxIter=25, tol=1e-6, regParam=0.0, solver="irls", | ||
| linkPredictionCol="") | ||
| self._setDefault(family="gaussian", maxIter=25, tol=1e-6, regParam=0.0, solver="irls") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps we should then make the default in __init__ and setParams be None rather than ""?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was not sure about this. So just following other similar params in other classes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sounds good. Let me update it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the setDefault change you have is fine - I mean the default
constructor param
On Fri, 20 May 2016 at 11:37, Liang-Chi Hsieh [email protected]
wrote:
In python/pyspark/ml/regression.py
#13220 (comment):@@ -1312,8 +1312,7 @@ def init(self, labelCol="label", featuresCol="features", predictionCol="pred
super(GeneralizedLinearRegression, self).init()
self._java_obj = self._new_java_obj(
"org.apache.spark.ml.regression.GeneralizedLinearRegression", self.uid)
self._setDefault(family="gaussian", maxIter=25, tol=1e-6, regParam=0.0, solver="irls",linkPredictionCol="")self._setDefault(family="gaussian", maxIter=25, tol=1e-6, regParam=0.0, solver="irls")I was not sure about this. So just following other similar params in other
classes.—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
https://github.com/apache/spark/pull/13220/files/132753838a1424f09d85003a4436bb7ce0bd7774#r64014350
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yea. Updated. Thanks.
|
Test build #58973 has finished for PR 13220 at commit
|
|
Fortunately it doesn't actually impact since there is a |
|
@MLnick unit tests in ml.tests will be failed due to this. This should be the only impact I think. |
|
Test build #58975 has finished for PR 13220 at commit
|
|
@MLnick Please see if this is fine for you now. Thanks. |
|
This fix looks good. Thanks! @viirya |
|
Thank you @yanboliang |
|
ping @MLnick |
…nkPredictionCol for GeneralizedLinearRegression ## What changes were proposed in this pull request? Default value mismatch of param linkPredictionCol for GeneralizedLinearRegression between PySpark and Scala. That is because default value conflict between #13106 and #13129. This causes ml.tests failed. ## How was this patch tested? Existing tests. Author: Liang-Chi Hsieh <[email protected]> Closes #13220 from viirya/hotfix-regresstion. (cherry picked from commit 4e73933) Signed-off-by: Nick Pentreath <[email protected]>
|
Merged to master/branch-2.0. Thanks @viirya! |
|
Thank you @MLnick |
|
Thanks for fixing this @viirya :) |
What changes were proposed in this pull request?
Default value mismatch of param linkPredictionCol for GeneralizedLinearRegression between PySpark and Scala. That is because default value conflict between #13106 and #13129. This causes ml.tests failed.
How was this patch tested?
Existing tests.