Skip to content

Conversation

@kanzhang
Copy link
Contributor

@kanzhang kanzhang commented Jun 2, 2014

RDD.zip() will throw an exception if it finds partition sizes are not the same.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@kanzhang
Copy link
Contributor Author

kanzhang commented Jun 2, 2014

@mateiz This is to address the remaining issue of #760 (after #776). Since now zip() calls zipPartitions(), I removed ZippedRDD.scala. Pls take a look.

@AmplabJenkins
Copy link

Merged build finished.

@AmplabJenkins
Copy link

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15357/

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@AmplabJenkins
Copy link

Merged build finished.

@AmplabJenkins
Copy link

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15360/

@AmplabJenkins
Copy link

Build triggered.

@AmplabJenkins
Copy link

Build started.

@AmplabJenkins
Copy link

Build finished.

@AmplabJenkins
Copy link

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15362/

@kanzhang
Copy link
Contributor Author

kanzhang commented Jun 3, 2014

@pwendell I got the following test error, not sure what I can do. Can you help?

Could not find Apache license headers in the following files:
 !????? /root/workspace/SparkPullRequestBuilder/.generated-mima-excludes
Build step 'Execute shell' marked build as failure

@marmbrus
Copy link
Contributor

marmbrus commented Jun 3, 2014

I think you might need to rebase your change as it probably is not merging cleanly.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Make this a SparkException instead

@mateiz
Copy link
Contributor

mateiz commented Jun 3, 2014

Weird, might be a temporary issue in Jenkins.

@AmplabJenkins
Copy link

Build triggered.

@kanzhang
Copy link
Contributor Author

kanzhang commented Jun 3, 2014

@mateiz updated
@marmbrus thanks for your note. I'll rebase if it still fails.

@AmplabJenkins
Copy link

Build started.

@AmplabJenkins
Copy link

Build finished.

@AmplabJenkins
Copy link

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15369/

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@AmplabJenkins
Copy link

Merged build finished. All automated tests passed.

@AmplabJenkins
Copy link

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15371/

@kanzhang
Copy link
Contributor Author

kanzhang commented Jun 3, 2014

Are we going to include this patch for 1.0.x? If so, I'll need to update MimaExcludes for it.

@mateiz
Copy link
Contributor

mateiz commented Jun 3, 2014

Wait, why did you need to add this to MimaExcludes? You're not changing the API.

@kanzhang
Copy link
Contributor Author

kanzhang commented Jun 3, 2014

MiMa complains as follows, since I deleted ZippedRDD.scala. However, both classes are marked private[spark], I suppose it is OK to exclude?

[error]  * class org.apache.spark.rdd.ZippedRDD does not have a correspondent in new version
[error]    filter with: ProblemFilters.exclude[MissingClassProblem]("org.apache.spark.rdd.ZippedRDD")
[error]  * class org.apache.spark.rdd.ZippedPartition does not have a correspondent in new version
[error]    filter with: ProblemFilters.exclude[MissingClassProblem]("org.apache.spark.rdd.ZippedPartition")

@mateiz
Copy link
Contributor

mateiz commented Jun 3, 2014

Ah, got it. @pwendell what do you think? I think we'd just add them to the excludes.

@mateiz
Copy link
Contributor

mateiz commented Jun 3, 2014

(This is probably an oversight in the way we set up MIMA).

@rxin
Copy link
Contributor

rxin commented Jun 4, 2014

Yea we should just add the rules. I had to work around the same problem when I removed SerializableHyperLogLog.

@rxin
Copy link
Contributor

rxin commented Jun 4, 2014

BTW due to two recent PR merges, I think this no longer merges cleanly. Do you mind updating the PR to rebase it to master? Thanks.

@pwendell
Copy link
Contributor

pwendell commented Jun 4, 2014

Yeah this is one case where we need to manually add excludes. The reason is MIMA doesn't have proper support for package privacy, and our work-around can't handle the case where we delete a previously package private thinng.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@kanzhang
Copy link
Contributor Author

kanzhang commented Jun 4, 2014

@rxin rebased just now. Thanks!

@AmplabJenkins
Copy link

Merged build finished. All automated tests passed.

@AmplabJenkins
Copy link

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15431/

@rxin
Copy link
Contributor

rxin commented Jun 4, 2014

Thanks. I've merged this into master branch.

@asfgit asfgit closed this in c402a4a Jun 4, 2014
@kanzhang kanzhang deleted the SPARK-1817 branch June 5, 2014 03:29
pdeyhim pushed a commit to pdeyhim/spark-1 that referenced this pull request Jun 25, 2014
RDD.zip() will throw an exception if it finds partition sizes are not the same.

Author: Kan Zhang <[email protected]>

Closes apache#944 from kanzhang/SPARK-1817 and squashes the following commits:

c073848 [Kan Zhang] [SPARK-1817] Cosmetic updates
524c670 [Kan Zhang] [SPARK-1817] RDD.zip() should verify partition sizes for each partition
xiliu82 pushed a commit to xiliu82/spark that referenced this pull request Sep 4, 2014
RDD.zip() will throw an exception if it finds partition sizes are not the same.

Author: Kan Zhang <[email protected]>

Closes apache#944 from kanzhang/SPARK-1817 and squashes the following commits:

c073848 [Kan Zhang] [SPARK-1817] Cosmetic updates
524c670 [Kan Zhang] [SPARK-1817] RDD.zip() should verify partition sizes for each partition
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants