-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-11847] [ML] Model export/import for spark.ml: LDA #9894
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #46492 has finished for PR 9894 at commit
|
|
Reviewing now |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
scala style: put vocabSize on next line
|
Thanks for the PR! My only other comment is about the missing DistributedLDAModel unit test. In the future, I also want to add an object LDA which implements Readable and returns an LDAModel. But we can skip that for now. |
|
@jkbradley Thanks for review. Do you mean a LDAModel.load(path) which can load both local and distributed LDAModel? We can create a jira. |
|
Test build #46581 has finished for PR 9894 at commit
|
Yes, that's what I had in mind. That way, most users will never need to know about local vs. distributed. |
|
LGTM. @hhbyyh Thanks for the updates! Is there anything left to do? If not, can you please remove the "WIP" label? I can then merge it. |
|
@jkbradley removed. Thanks. I'll create a jira. |
Add read/write support to LDA, similar to ALS. save/load for ml.LocalLDAModel is done. For DistributedLDAModel, I'm not sure if we can invoke save on the mllib.DistributedLDAModel directly. I'll send update after some test. Author: Yuhao Yang <[email protected]> Closes #9894 from hhbyyh/ldaMLsave. (cherry picked from commit 52bc25c) Signed-off-by: Xiangrui Meng <[email protected]>
|
Merged into master and branch-1.6. Thanks! |
Add read/write support to LDA, similar to ALS.
save/load for ml.LocalLDAModel is done.
For DistributedLDAModel, I'm not sure if we can invoke save on the mllib.DistributedLDAModel directly. I'll send update after some test.