-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-12016][MLlib][PySpark] Wrap Word2VecModel when loading it in pyspark #10100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #47056 has finished for PR 10100 at commit
|
|
Test build #47061 has finished for PR 10100 at commit
|
|
This is only a suggestion, but looking at the other Java loadable models which are also wrapped in python it seems the standard approach is to instead add a load[X] method to the PythonMLLibAPI instead of doing the wrapper in Python. Might be worth doing that way just for consistencies sake - what are your thoughts? |
|
It makes sense. Thanks. I will update this later. |
|
Test build #47122 has finished for PR 10100 at commit
|
|
ping @davies Can you take a look of this? It should be straightforward one. |
|
LGTM, merging this into master, thanks! |
… pyspark JIRA: https://issues.apache.org/jira/browse/SPARK-12016 We should not directly use Word2VecModel in pyspark. We need to wrap it in a Word2VecModelWrapper when loading it in pyspark. Author: Liang-Chi Hsieh <[email protected]> Closes #10100 from viirya/fix-load-py-wordvecmodel. (cherry picked from commit b51a4cd) Signed-off-by: Joseph K. Bradley <[email protected]>
… pyspark JIRA: https://issues.apache.org/jira/browse/SPARK-12016 We should not directly use Word2VecModel in pyspark. We need to wrap it in a Word2VecModelWrapper when loading it in pyspark. Author: Liang-Chi Hsieh <[email protected]> Closes #10100 from viirya/fix-load-py-wordvecmodel. (cherry picked from commit b51a4cd) Signed-off-by: Joseph K. Bradley <[email protected]>
|
I just backported this to 1.6 and 1.5 |
JIRA: https://issues.apache.org/jira/browse/SPARK-12016
We should not directly use Word2VecModel in pyspark. We need to wrap it in a Word2VecModelWrapper when loading it in pyspark.