[SPARK-19110][ML][MLLIB]:DistributedLDAModel returns different logPrior for original and loaded model #16491

wangmiao1981 · 2017-01-06T20:44:58Z

What changes were proposed in this pull request?

While adding DistributedLDAModel training summary for SparkR, I found that the logPrior for original and loaded model is different.
For example, in the test("read/write DistributedLDAModel"), I add the test:
val logPrior = model.asInstanceOf[DistributedLDAModel].logPrior
val logPrior2 = model2.asInstanceOf[DistributedLDAModel].logPrior
assert(logPrior === logPrior2)
The test fails:
-4.394180878889078 did not equal -4.294290536919573

The reason is that graph.vertices.aggregate(0.0)(seqOp, _ + _) only returns the value of a single vertex instead of the aggregation of all vertices. Therefore, when the loaded model does the aggregation in a different order, it returns different logPrior.

Please refer to #16464 for details.

How was this patch tested?

Add a new unit test for testing logPrior.

wangmiao1981 · 2017-01-06T20:49:16Z

Jenkins, retest this please.

SparkQA · 2017-01-06T21:57:00Z

Test build #70992 has finished for PR 16491 at commit a29d078.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

felixcheung · 2017-01-06T22:48:12Z

mllib/src/test/scala/org/apache/spark/ml/clustering/LDASuite.scala

+      val trainingLogLikelihood2 =
+        model2.asInstanceOf[DistributedLDAModel].trainingLogLikelihood
+      assert(logPrior ~== logPrior2 absTol 1e-6)
+      assert(trainingLogLikelihood ~== trainingLogLikelihood2 absTol 1e-6)


should we check trainingLogLikelihood and logPrior are not changing for LocalLDAModel?

logLikelihood and logPrior are only for distributed model.

right - I mean that they are not persisted & loaded into an unexpected but valid value (!= Double.NaN)

LocalLDAModel doesn't extend DistributedLDAModel and vice versa. I am not clear how to check trainingLogLikelihood and logPrior in LocalLDAModel.

Ok, I guess I remember this wrong because of the other PR.

felixcheung · 2017-01-06T22:48:35Z

@jkbradley @yanboliang please have a look

jkbradley · 2017-01-07T19:06:37Z

Yikes, thanks for fixing this!
LGTM
Merging with master
I'll also try to merge it with branch-2.1, branch-2.0 but will say if I run into issues.

…or for original and loaded model ## What changes were proposed in this pull request? While adding DistributedLDAModel training summary for SparkR, I found that the logPrior for original and loaded model is different. For example, in the test("read/write DistributedLDAModel"), I add the test: val logPrior = model.asInstanceOf[DistributedLDAModel].logPrior val logPrior2 = model2.asInstanceOf[DistributedLDAModel].logPrior assert(logPrior === logPrior2) The test fails: -4.394180878889078 did not equal -4.294290536919573 The reason is that `graph.vertices.aggregate(0.0)(seqOp, _ + _)` only returns the value of a single vertex instead of the aggregation of all vertices. Therefore, when the loaded model does the aggregation in a different order, it returns different `logPrior`. Please refer to #16464 for details. ## How was this patch tested? Add a new unit test for testing logPrior. Author: [email protected] <[email protected]> Closes #16491 from wangmiao1981/ldabug. (cherry picked from commit 036b503) Signed-off-by: Joseph K. Bradley <[email protected]>

…or for original and loaded model ## What changes were proposed in this pull request? While adding DistributedLDAModel training summary for SparkR, I found that the logPrior for original and loaded model is different. For example, in the test("read/write DistributedLDAModel"), I add the test: val logPrior = model.asInstanceOf[DistributedLDAModel].logPrior val logPrior2 = model2.asInstanceOf[DistributedLDAModel].logPrior assert(logPrior === logPrior2) The test fails: -4.394180878889078 did not equal -4.294290536919573 The reason is that `graph.vertices.aggregate(0.0)(seqOp, _ + _)` only returns the value of a single vertex instead of the aggregation of all vertices. Therefore, when the loaded model does the aggregation in a different order, it returns different `logPrior`. Please refer to apache#16464 for details. ## How was this patch tested? Add a new unit test for testing logPrior. Author: [email protected] <[email protected]> Closes apache#16491 from wangmiao1981/ldabug.

…nd logLikelihood of DistributedLDAModel in MLLIB ## What changes were proposed in this pull request? apache#16491 added the fix to mllib and a unit test to ml. This followup PR, add unit tests to mllib suite. ## How was this patch tested? Unit tests. Author: [email protected] <[email protected]> Closes apache#16524 from wangmiao1981/ldabug.

…or for original and loaded model ## What changes were proposed in this pull request? While adding DistributedLDAModel training summary for SparkR, I found that the logPrior for original and loaded model is different. For example, in the test("read/write DistributedLDAModel"), I add the test: val logPrior = model.asInstanceOf[DistributedLDAModel].logPrior val logPrior2 = model2.asInstanceOf[DistributedLDAModel].logPrior assert(logPrior === logPrior2) The test fails: -4.394180878889078 did not equal -4.294290536919573 The reason is that `graph.vertices.aggregate(0.0)(seqOp, _ + _)` only returns the value of a single vertex instead of the aggregation of all vertices. Therefore, when the loaded model does the aggregation in a different order, it returns different `logPrior`. Please refer to apache#16464 for details. ## How was this patch tested? Add a new unit test for testing logPrior. Author: [email protected] <[email protected]> Closes apache#16491 from wangmiao1981/ldabug.

…nd logLikelihood of DistributedLDAModel in MLLIB ## What changes were proposed in this pull request? apache#16491 added the fix to mllib and a unit test to ml. This followup PR, add unit tests to mllib suite. ## How was this patch tested? Unit tests. Author: [email protected] <[email protected]> Closes apache#16524 from wangmiao1981/ldabug.

wangmiao1981 added 2 commits January 6, 2017 12:40

fix the bug of logPrior

31d9bce

improve test

a29d078

wangmiao1981 mentioned this pull request Jan 6, 2017

[SPARK-19066][SparkR]:SparkR LDA doesn't set optimizer correctly #16464

Closed

felixcheung reviewed Jan 6, 2017

View reviewed changes

asfgit closed this in 036b503 Jan 7, 2017

wangmiao1981 mentioned this pull request Jan 10, 2017

[SPARK-19110][MLLIB][FollowUP]: Add a unit test for testing logPrior and logLikelihood of DistributedLDAModel in MLLIB #16524

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-19110][ML][MLLIB]:DistributedLDAModel returns different logPrior for original and loaded model #16491

[SPARK-19110][ML][MLLIB]:DistributedLDAModel returns different logPrior for original and loaded model #16491

Uh oh!

wangmiao1981 commented Jan 6, 2017

Uh oh!

wangmiao1981 commented Jan 6, 2017

Uh oh!

SparkQA commented Jan 6, 2017

Uh oh!

felixcheung Jan 6, 2017

Uh oh!

wangmiao1981 Jan 6, 2017

Uh oh!

felixcheung Jan 6, 2017

Uh oh!

wangmiao1981 Jan 7, 2017

Uh oh!

felixcheung Jan 7, 2017

Uh oh!

felixcheung commented Jan 6, 2017

Uh oh!

jkbradley commented Jan 7, 2017 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-19110][ML][MLLIB]:DistributedLDAModel returns different logPrior for original and loaded model #16491

[SPARK-19110][ML][MLLIB]:DistributedLDAModel returns different logPrior for original and loaded model #16491

Uh oh!

Conversation

wangmiao1981 commented Jan 6, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

wangmiao1981 commented Jan 6, 2017

Uh oh!

SparkQA commented Jan 6, 2017

Uh oh!

felixcheung Jan 6, 2017

Choose a reason for hiding this comment

Uh oh!

wangmiao1981 Jan 6, 2017

Choose a reason for hiding this comment

Uh oh!

felixcheung Jan 6, 2017

Choose a reason for hiding this comment

Uh oh!

wangmiao1981 Jan 7, 2017

Choose a reason for hiding this comment

Uh oh!

felixcheung Jan 7, 2017

Choose a reason for hiding this comment

Uh oh!

felixcheung commented Jan 6, 2017

Uh oh!

jkbradley commented Jan 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jkbradley commented Jan 7, 2017 •

edited

Loading