[SPARK-3580][CORE] Add Consistent Method To Get Number of RDD Partitions Across Different Languages #9767

schot · 2015-11-17T10:11:59Z

I have tried to address all the comments in pull request #2447.

Note that the second commit (using the new method in all internal code of all components) is quite intrusive and could be omitted.

srowen · 2015-11-17T10:51:43Z

This is such a noisy change for really little gain that I don't think it's worth it. At best just ensure there is a getNumPartitions in each language.

schot · 2015-11-17T12:15:24Z

Although it would be nice to fix the inconsistencies in the usage of partitions.length vs partitions.size, I agree that its better to stick to just adding getNumPartitions. I will remove the second commit from the PR.

Should we also add a getNumPartitions method to JavaRDDLike for the Java API?

srowen · 2015-11-17T12:17:47Z

Yes, ideally. @JoshRosen am I right that we have to add the new method to the JavaRDDLike trait or the abstract class that implements it in order to avoid binary incompatibility? Both?

schot · 2015-11-17T20:54:16Z

If I understand the discussion in SPARK-3266 correctly the method should only be added to the JavaRRDLike trait and not the abstract class.

I have updated the PR with this change.

JoshRosen · 2015-11-18T19:29:00Z

Yeah, AFAIK you only need to add it to the trait.

JoshRosen · 2015-11-18T19:29:52Z

core/src/main/scala/org/apache/spark/api/java/JavaRDDLike.scala

Mind adding a @since tag here to say that this is new in Spark 1.6?

Not at all. Should I use @since or @Since, and also add it to RDD.scala? (I could not find any occurrence of this tag in spark-core).

I think you should use Spark's own @Since tag, since it's used in MLLib and SQL: https://github.com/apache/spark/blob/31921e0f0bd559d042148d1ea32f865fb3068f38/core/src/main/scala/org/apache/spark/annotation/Since.scala

Thanks, I added the @Since tags and updated this PR.

srowen · 2015-11-21T15:10:22Z

I think this looks good.

SparkQA · 2015-11-21T15:28:55Z

Test build #2094 has finished for PR 9767 at commit 2324016.

This patch fails MiMa tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2015-11-23T09:35:46Z

@schot I think you will have to add the MiMa exclude that the error message in the output mentions. It's a 'false positive' but MiMa needs to be reassured.

schot · 2015-11-23T13:02:53Z

@srowen Yes, I added a Mima exclude.

schot · 2015-11-25T12:18:58Z

Because I merged the fix in the previous commit Jenkins retest does not happen automatically?

SparkQA · 2015-11-25T14:11:34Z

Test build #2111 has finished for PR 9767 at commit fb5dcdf.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2015-11-28T10:16:30Z

@schot this'll need a rebase. @JoshRosen are you OK with this for 1.6?

This patch adds a new method getNumPartitions to the Scala RDD and JavaRDDLike APIs as proposed in [SPARK-3580]. It brings the Scala and Java APIs in line with the Python API. For the Java API we added a Mima exclude.

schot · 2015-11-28T10:24:23Z

@srowen @JoshRosen PR has been rebased to resolve the conflict on MimaExcludes.

srowen · 2015-12-01T09:34:19Z

Pinging @JoshRosen @pwendell for an opinion on slipping this into 1.6. I'm still inclined to put it in but I'm aware this week there's a more conservative stance on 1.6. I wanted to wait a day before I did this.

pwendell · 2015-12-02T07:07:26Z

Yeah I think it's fine to pull in - but do it quickly because an RC will go out very soon!

…ons Across Different Languages I have tried to address all the comments in pull request #2447. Note that the second commit (using the new method in all internal code of all components) is quite intrusive and could be omitted. Author: Jeroen Schot <[email protected]> Closes #9767 from schot/master. (cherry picked from commit 128c290) Signed-off-by: Sean Owen <[email protected]>

srowen · 2015-12-02T09:43:30Z

merged to master / 1.6

schot changed the title ~~[SPARK-3580] Add Consistent Method To Get Number of RDD Partitions Across Different Languages~~ [SPARK-3580][CORE] Add Consistent Method To Get Number of RDD Partitions Across Different Languages Nov 17, 2015

JoshRosen reviewed Nov 18, 2015
View reviewed changes

[SPARK-3580][CORE] Add getNumPartitions method

84c7546

This patch adds a new method getNumPartitions to the Scala RDD and JavaRDDLike APIs as proposed in [SPARK-3580]. It brings the Scala and Java APIs in line with the Python API. For the Java API we added a Mima exclude.

asfgit closed this in 128c290 Dec 2, 2015

[SPARK-3580][CORE] Add Consistent Method To Get Number of RDD Partitions Across Different Languages #9767

[SPARK-3580][CORE] Add Consistent Method To Get Number of RDD Partitions Across Different Languages #9767

Uh oh!

Conversation

schot commented Nov 17, 2015

Uh oh!

srowen commented Nov 17, 2015

Uh oh!

schot commented Nov 17, 2015

Uh oh!

srowen commented Nov 17, 2015

Uh oh!

schot commented Nov 17, 2015

Uh oh!

JoshRosen commented Nov 18, 2015

Uh oh!

JoshRosen Nov 18, 2015

Choose a reason for hiding this comment

Uh oh!

schot Nov 18, 2015

Choose a reason for hiding this comment

Uh oh!

JoshRosen Nov 18, 2015

Choose a reason for hiding this comment

Uh oh!

schot Nov 18, 2015

Choose a reason for hiding this comment

Uh oh!

srowen commented Nov 21, 2015

Uh oh!

SparkQA commented Nov 21, 2015

Uh oh!

srowen commented Nov 23, 2015

Uh oh!

schot commented Nov 23, 2015

Uh oh!

schot commented Nov 25, 2015

Uh oh!

SparkQA commented Nov 25, 2015

Uh oh!

srowen commented Nov 28, 2015

Uh oh!

schot commented Nov 28, 2015

Uh oh!

srowen commented Dec 1, 2015

Uh oh!

pwendell commented Dec 2, 2015

Uh oh!

srowen commented Dec 2, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants