[SPARK-2505][MLlib] Weighted Regularizer for Generalized Linear Model #1518

dbtsai · 2014-07-21T21:42:25Z

(Note: This is not ready to be merged. Need documentation, and make sure it's backforwad compatible with Spark 1.0 apis).

The current implementation of regularization in linear model is using Updater, and this design has couple issues as the following.

It will penalize all the weights including intercept. In machine learning training process, typically, people don't penalize the intercept.
The Updater has the logic of adaptive step size for gradient decent, and we would like to clean it up by separating the logic of regularization out from updater to regularizer so in LBFGS optimizer, we don't need the trick for getting the loss and gradient of objective function.
In this work, a weighted regularizer will be implemented, and users can exclude the intercept or any weight from regularization by setting that term with zero weighted penalty. Since the regularizer will return a tuple of loss and gradient, the adaptive step size logic, and soft thresholding for L1 in Updater will be moved to SGD optimizer.

SparkQA · 2014-07-21T21:48:20Z

QA tests have started for PR 1518. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16928/consoleFull

SparkQA · 2014-07-21T21:49:03Z

QA results for PR 1518:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds the following public classes (experimental):
abstract class Regularizer extends Serializable {
class SimpleRegularizer extends Regularizer {
class CompositeRegularizer extends Regularizer {

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16928/consoleFull

mengxr · 2014-07-29T06:49:19Z

@dbtsai I thought another way to do this and want to know your opinion. We can add an optional argument to appendBias: appendBias(bias: Double = 1.0). If this is used in adding intercept, we can add a large bias so the corresponding weight gets less regularized.

dbtsai · 2014-07-30T19:01:18Z

I tried to make the bias really big to make the intercept smaller to avoid being regularized. The result is still quite different from R, and very sensitive to the strength of bias.

Users may re-scale the features to improve the convergence of optimization process, and in order to get the same coefficients without scaling, each component has to be penalized differently. Also, users may know which feature is less important, and want to penalize more.

As a result, I still want to implement the full weighted regualizer, and de-couple the adaptive learning rate from updater. Let's talk in detail when we meet tomorrow. Thanks.

mengxr · 2014-07-30T22:59:00Z

I think this is the approach LIBLINEAR uses. Yes, let's discuss tomorrow.

MLnick · 2014-08-05T05:23:21Z

This looks promising. FWIW, I support decoupling regularization from the raw gradient update and believe it is a good way to go - it will allow various update/learning rate schemes (adagrad, normalized adaptive gradient, etc) to be applied independent of the regularization.

dbtsai · 2014-08-05T05:26:43Z

It's too late to get into 1.1, but I'll try to make it happen in 1.2. We'll use this in company implementation first.

witgo · 2014-12-22T15:30:41Z

mllib/src/main/scala/org/apache/spark/mllib/optimization/Regularizer.scala

The case statement will not affect performance?

This PR is not finished yet. Will replace this with the new implemented api foreachActive.

srowen · 2015-03-05T17:18:58Z

I'm looking at really old PRs -- this is obsolete now, right?

dbtsai · 2015-03-05T21:12:31Z

@srowen I'm still working on this PR, but unfortunately, I didn't have enough time to finish it so I keep delaying. This PR is important since it will be a general framework to solve L1/L2 problem. The current way we use Updater is very awkward in my opinion.

initial work

2946930

dbtsai mentioned this pull request Aug 31, 2014

[SPARK-3317][MLlib] The loss of regularization in Updater should use the oldWeights #2207

Closed

witgo reviewed Dec 22, 2014
View reviewed changes

dbtsai mentioned this pull request Jan 6, 2015

[MLLib]SPARK-5027:add SVMWithLBFGS interface in MLLIB #3890

Closed

dbtsai mentioned this pull request Mar 5, 2015

[SPARK-1892][MLLIB] Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 re... #840

Closed

dbtsai mentioned this pull request Mar 26, 2015

[SPARK-5253] [ML] LinearRegression with L1/L2 (ElasticNet) using OWLQN #4259

Closed

dbtsai closed this Jun 30, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-2505][MLlib] Weighted Regularizer for Generalized Linear Model #1518

[SPARK-2505][MLlib] Weighted Regularizer for Generalized Linear Model #1518

Uh oh!

dbtsai commented Jul 21, 2014

Uh oh!

SparkQA commented Jul 21, 2014

Uh oh!

SparkQA commented Jul 21, 2014

Uh oh!

mengxr commented Jul 29, 2014

Uh oh!

dbtsai commented Jul 30, 2014

Uh oh!

mengxr commented Jul 30, 2014

Uh oh!

MLnick commented Aug 5, 2014

Uh oh!

dbtsai commented Aug 5, 2014

Uh oh!

witgo Dec 22, 2014

Uh oh!

dbtsai Dec 22, 2014

Uh oh!

srowen commented Mar 5, 2015

Uh oh!

dbtsai commented Mar 5, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-2505][MLlib] Weighted Regularizer for Generalized Linear Model #1518

[SPARK-2505][MLlib] Weighted Regularizer for Generalized Linear Model #1518

Uh oh!

Conversation

dbtsai commented Jul 21, 2014

Uh oh!

SparkQA commented Jul 21, 2014

Uh oh!

SparkQA commented Jul 21, 2014

Uh oh!

mengxr commented Jul 29, 2014

Uh oh!

dbtsai commented Jul 30, 2014

Uh oh!

mengxr commented Jul 30, 2014

Uh oh!

MLnick commented Aug 5, 2014

Uh oh!

dbtsai commented Aug 5, 2014

Uh oh!

witgo Dec 22, 2014

Choose a reason for hiding this comment

Uh oh!

dbtsai Dec 22, 2014

Choose a reason for hiding this comment

Uh oh!

srowen commented Mar 5, 2015

Uh oh!

dbtsai commented Mar 5, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants