-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[Spark-14564] [ML] [MLlib] [PySpark] Python Word2Vec missing setWindowSize method #12428
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
python/pyspark/ml/feature.py
Outdated
| "the minimum number of times a token must appear to be included in the " + | ||
| "word2vec model's vocabulary", typeConverter=TypeConverters.toInt) | ||
| windowSize = Param(Params._dummy(), "windowSize", | ||
| "the window size (context words from [-window, window])", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should mention the default value.
|
Thanks for taking the initiative to do this. A few minor comments from the first pass through, but in the meantime maybe one the admins (possibly @jkbradley) could either say ok to jenkins to test or add to the whitelist? |
|
ok to test |
python/pyspark/ml/tests.py
Outdated
|
|
||
| def test_word2vec_param(self): | ||
| model = Word2Vec() \ | ||
| .setWindowSize(6) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
minor, but this can be one line
|
One minor comment. Pending that, the tests and @holdenk's comments, LGTM. |
|
LGTM pending tests :) |
|
Test build #56080 has finished for PR 12428 at commit
|
|
LGTM |
…ze method ## What changes were proposed in this pull request? Added windowSize getter/setter to ML/MLlib ## How was this patch tested? Added test cases in tests.py under both ML and MLlib Author: Jason Lee <[email protected]> Closes apache#12428 from jasoncl/SPARK-14564.
What changes were proposed in this pull request?
Added windowSize getter/setter to ML/MLlib
How was this patch tested?
Added test cases in tests.py under both ML and MLlib