[SPARK-1892][MLLIB] Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 re... #840

codedeft · 2014-05-20T18:29:26Z

Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 and L1 regularizations together (balanced with alpha as in elastic nets). It extends LBFGS. It uses the OWL-QN implementation from breeze (which didn't work correctly before, but it was also fixed prior to this and committed to the latest breeze). Therefore, it requires the latest version of breeze to work correctly.

codedeft · 2014-05-20T18:30:42Z

jira link :

https://issues.apache.org/jira/browse/SPARK-1892

AmplabJenkins · 2014-05-20T18:32:58Z

Can one of the admins verify this patch?

codedeft · 2014-05-20T18:39:48Z

To clarify - it requires the latest breeze. The OWL-QN in breeze had bugs, which I fixed. I'm not sure if David's published an official release yet but it's in the latest snapshot.

codedeft · 2014-05-20T18:51:19Z

I'll try to get David to publish the latest breeze and change the project file to reference the latest breeze.

codedeft · 2014-05-22T23:57:03Z

Breeze has been updated to 0.8. This should now work.

debasish83 · 2014-05-23T04:26:13Z

@codedeft make sure breeze is updated in pom.xml as well if this PR is merged in....I pulled in your code to test our internal datasets...in mllib/pom.xml it is still at 0.7..please update it to 0.8...

org.scalanlp breeze_${scala.binary.version} 0.7

codedeft · 2014-05-23T18:21:19Z

Done!

gzm55 · 2014-07-22T04:52:19Z

@codedeft could you fix the conflicts when merging into master?

mengxr · 2014-08-02T04:34:01Z

@codedeft Could you add [SPARK-1892][MLLIB] to the title of this PR? So it shows up in the result if people search for the JIRA or [MLLIB]. Thanks!

… regularizations together. It extends LBFGS. It uses the OWL-QN implementation from breeze (which didn't work correctly before, but it was also fixed prior to this). It requires the latest version of breeze to work correctly.

SparkQA · 2014-09-05T23:46:36Z

Can one of the admins verify this patch?

debasish83 · 2014-10-01T02:35:50Z

@codedeft I am trying to test OWLQN and compare with a C++ baseline on our data but it seems it runs only one iteration...I am setting up 10 iterations and convergenceTol as 1e-4...caller code is simple:

class LogisticRegressionWithBFGS private (
private var numIterations: Int,
private var alpha: Double,
private var regParam: Double)
extends GeneralizedLinearAlgorithm[LogisticRegressionModel] with Serializable {
private val gradient = new LogisticGradient()
override val optimizer = new OWLQN(gradient).setAlpha(alpha).setRegParam(regParam)
override protected val validators = List(DataValidators.binaryLabelValidator)
override protected def createModel(weights: Vector, intercept: Double) = {
new LogisticRegressionModel(weights, intercept)
}
}

Are there any issues with the way Breeze OWLQN is called ?

codedeft · 2014-10-01T03:09:32Z

@debasish83 We fixed the previously broken Breeze OWLQN in Breeze 0.8 and we know that the new Breeze OWLQN works as expected. However, this particular PR does not address a (previously) existing problem in MLLib where regularization is applied to not just weights, but also the intercept. So if you are comparing against results that do L1 regularization properly (i.e. leaving out the intercept from regularization), you'll get different results.

I think @dbtsai might be working on the intercept issue. I'm not sure if that's been merged yet.

debasish83 · 2014-10-01T03:20:44Z

I am not that much bothered with intercept right now...Say if I force intercept to 0, this should work as expected right ?

codedeft · 2014-10-01T03:46:04Z

@debasish83
Yes. Or at least back when I tested it 4 months ago ;(

dbtsai · 2014-10-01T09:28:09Z

@debasish83 and @codedeft The weighted method for OWLQN in breeze is merged scalanlp/breeze@2570911

I will submit a PR to Spark to use newer version of breeze with this feature once @dlwh publishes to this to maven. But there is still some work in mllib side to have it working properly. I'll work on this once I'm back from vacation.

dlwh · 2014-10-06T22:29:39Z

breeze 0.10 is released.

On Wed, Oct 1, 2014 at 2:28 AM, DB Tsai [email protected] wrote:

@debasish83 https://github.com/debasish83 and @codedeft
https://github.com/codedeft The weighted method for OWLQN in breeze is
merged scalanlp/breeze@2570911
scalanlp/breeze@2570911

I will submit a PR to Spark to use newer version of breeze with this
feature once @dlwh https://github.com/dlwh publishes to this to maven.
But there is still some work in mllib side to have it working properly.
I'll work on this once I'm back from vacation.

—
Reply to this email directly or view it on GitHub
#840 (comment).

srowen · 2015-03-05T17:19:57Z

Mind closing this PR?

dbtsai · 2015-03-05T21:13:10Z

This will be replaced by #1518 (comment)

Co-authored-by: Egor Krivokon <>

…pache#840)

codedeft changed the title ~~Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 re...~~ [SPARK-1892][MLLIB] Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 re... Aug 3, 2014

Sung Chung and others added 2 commits August 3, 2014 15:31

Updating the breeze version to 0.8.1.

6e833e0

dbtsai deleted the OWL_QN_Addition branch October 28, 2014 19:15

codedeft closed this Mar 5, 2015

agirish pushed a commit to HPEEzmeral/apache-spark that referenced this pull request May 5, 2022

MapR [SPARK-901] Spark-3.1.1 doesn't start by warden (apache#840)

a784a1b

Co-authored-by: Egor Krivokon <>

udaynpusa pushed a commit to mapr/spark that referenced this pull request Jan 30, 2024

MapR [SPARK-901] Spark-3.1.1 doesn't start by warden (apache#840)

dd7ffc4

Co-authored-by: Egor Krivokon <>

mapr-devops pushed a commit to mapr/spark that referenced this pull request May 8, 2025

MapR [SPARK-901] Spark-3.1.1 doesn't start by warden (apache#840)

cb8042e

Co-authored-by: Egor Krivokon <>

turboFei added a commit to turboFei/spark that referenced this pull request Nov 6, 2025

[HADP-59466] Fix InsertIntoHiveDirCommand staging dir deleted issue (a…

32017dd

…pache#840)

[SPARK-1892][MLLIB] Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 re... #840

[SPARK-1892][MLLIB] Adding OWL-QN optimizer for L1 regularizations. It can also handle L2 re... #840

Uh oh!

Conversation

codedeft commented May 20, 2014

Uh oh!

codedeft commented May 20, 2014

Uh oh!

AmplabJenkins commented May 20, 2014

Uh oh!

codedeft commented May 20, 2014

Uh oh!

codedeft commented May 20, 2014

Uh oh!

codedeft commented May 22, 2014

Uh oh!

debasish83 commented May 23, 2014

Uh oh!

codedeft commented May 23, 2014

Uh oh!

gzm55 commented Jul 22, 2014

Uh oh!

mengxr commented Aug 2, 2014

Uh oh!

SparkQA commented Sep 5, 2014

Uh oh!

debasish83 commented Oct 1, 2014

Uh oh!

codedeft commented Oct 1, 2014

Uh oh!

debasish83 commented Oct 1, 2014

Uh oh!

codedeft commented Oct 1, 2014

Uh oh!

dbtsai commented Oct 1, 2014

Uh oh!

dlwh commented Oct 6, 2014

Uh oh!

srowen commented Mar 5, 2015

Uh oh!

dbtsai commented Mar 5, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants