-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper convenience function to create Py4J JavaArrays #14725
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
BryanCutler
wants to merge
15
commits into
apache:master
from
BryanCutler:pyspark-new_java_array-CountVectorizer-SPARK-17161
Closed
Changes from all commits
Commits
Show all changes
15 commits
Select commit
Hold shift + click to select a range
2a8de60
testing out _new_java_array
BryanCutler 97bff07
Merge remote-tracking branch 'upstream/master' into wip-pyspark-new_j…
BryanCutler 4766cdc
undo changes to CountVectorizerModel used for testing
BryanCutler 1c0ddb9
added convienience functions to JavaWrapper to create py4j JavaArray
BryanCutler f9672bf
fixed style checks and tests
BryanCutler bbf7f58
added python3 compatibility for xrange and basestring
BryanCutler b9da983
added usage example for OneVsRestModel constructor
BryanCutler 32fd5bd
fixed python style error
BryanCutler 4957e0c
can remove OneVsRest Java/Python friendly constructor now
BryanCutler ba63530
Merge remote-tracking branch 'upstream/master' into pyspark-new_java_…
BryanCutler 834c641
added MiMa exclusion for OneVsRest py-friendly constructor
BryanCutler 9321af8
Merge remote-tracking branch 'upstream/master' into pyspark-new_java_…
BryanCutler 65dcfb6
removed function that tried to infer type, better to just specify exp…
BryanCutler 869981f
added test for empty lists and little cleanup
BryanCutler 8b401ec
removed unused def
BryanCutler File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are these imports needed anymore?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, good catch thanks! I wasn't using the def for
basestringbut I still am iterating the array/list withxrange. I was thinking it would be possible for someone to use a large list, like when using a vocabulary. I suppose I could useenumerateif you think that's better?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this look ok now @holdenk?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks fine, we use xrange similarly elsewhere.