-
Notifications
You must be signed in to change notification settings - Fork 28.9k
SPARK-554. Add aggregateByKey. #705
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. |
|
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14833/ |
|
Hey Sandy, this needs Java and Python APIs to be merged as well. We're going to stop adding Scala-only APIs at first and then fixing the other languages later. |
|
Makes sense. Updated patch adds Java and Python APIs. |
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. |
|
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15244/ |
python/pyspark/rdd.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You seem to confuse T and V here (same in the other doc comments)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My bad. Uploaded a patch that fixes this.
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. |
|
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15280/ |
|
@sryza Looks like you have a style error, see https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15280/console. You can run |
|
My bad - fixed these. |
|
Build triggered. |
|
Build started. |
|
Build finished. |
|
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15367/ |
|
Jenkins, retest this please |
|
Build triggered. |
|
Build started. |
|
Build finished. |
|
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15401/ |
|
Jenkins, retest this please |
|
Build triggered. |
|
Build started. |
|
Build finished. |
|
@sryza looks like the python tests are failing |
|
Ok, the test should be fixed for real this time. |
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. |
|
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15696/ |
|
I'm noticing a MIMA error. I think this must be a false positive? As the patch only adds APIs. |
|
@ScrapCodes - do you have any idea on this one? This doesn't make much sense to me at all! @sryza for now you can add an exclude in |
|
This happened because in the byte code there was a method as follows Which got changed to After this patch, not very sure if we could have avoided it. Since this method was an inner method. |
|
Merged build triggered. |
|
Merged build started. |
|
I just tried extracting createZero out and making it private, since the method is used in two places now. But making it private did not help, the mima warning is still there. |
|
Merged build finished. All automated tests passed. |
|
All automated tests passed. |
|
@ScrapCodes. Ah I see - there was a name collision with an inner function that was used elsewhere. |
|
Thanks Sandy - I'm merging this! |
Author: Sandy Ryza <[email protected]> Closes apache#705 from sryza/sandy-spark-554 and squashes the following commits: 2302b8f [Sandy Ryza] Add MIMA exclude f52e0ad [Sandy Ryza] Fix Python tests for real 2f3afa3 [Sandy Ryza] Fix Python test 0b735e9 [Sandy Ryza] Fix line lengths ae56746 [Sandy Ryza] Fix doc (replace T with V) c2be415 [Sandy Ryza] Java and Python aggregateByKey 23bf400 [Sandy Ryza] SPARK-554. Add aggregateByKey.
Author: Sandy Ryza <[email protected]> Closes apache#705 from sryza/sandy-spark-554 and squashes the following commits: 2302b8f [Sandy Ryza] Add MIMA exclude f52e0ad [Sandy Ryza] Fix Python tests for real 2f3afa3 [Sandy Ryza] Fix Python test 0b735e9 [Sandy Ryza] Fix line lengths ae56746 [Sandy Ryza] Fix doc (replace T with V) c2be415 [Sandy Ryza] Java and Python aggregateByKey 23bf400 [Sandy Ryza] SPARK-554. Add aggregateByKey.
…ache#705) Co-authored-by: Jeremy Liu <[email protected]>
…ath option (apache#705) * [SPARK-51281][SQL] DataFrameWriterV2 should respect the path option Unlike `DataFrameWriter.saveAsTable` where we explicitly get the "path" option and treat it as table location, `DataFrameWriterV2` doesn't do it and treats the "path" option as a normal option which doesn't have any real impact. This PR fixes it, and adds a legacy config to restore the old behavior. bug fix Yes, now `DataFrameWriterV2` can correctly write data to the specified path for file source tables. new test no Closes apache#50040 from cloud-fan/prop. Lead-authored-by: Wenchen Fan <[email protected]> Co-authored-by: Wenchen Fan <[email protected]> Signed-off-by: Wenchen Fan <[email protected]> (cherry picked from commit a3671e5) Signed-off-by: Wenchen Fan <[email protected]>
No description provided.