SPARK[1784]: Adding a balancedPartitioner #876

syedhashmi · 2014-05-26T01:04:55Z

This change adds a balanced partitioner to existing partitioners. The new partitioner uses round robin strategy to allocate keys to partitions so that we end up with balanced partitions for a RDD.

This change adds a new partitioner which allows users to specify # of keys per partition.

@markhamstra

https://issues.apache.org/jira/browse/SPARK-1686 moved from original JIRA (by @markhamstra): In deploy.master.Master, the completeRecovery method is the last thing to be called when a standalone Master is recovering from failure. It is responsible for resetting some state, relaunching drivers, and eventually resuming its scheduling duties. There are currently four places in Master.scala where completeRecovery is called. Three of them are from within the actor's receive method, and aren't problems. The last starts from within receive when the ElectedLeader message is received, but the actual completeRecovery() call is made from the Akka scheduler. That means that it will execute on a different scheduler thread, and Master itself will end up running (i.e., schedule() ) from that Akka scheduler thread. In this PR, I added a new master message TriggerSchedule to trigger the "local" call of schedule() in the scheduler thread Author: CodingCat <[email protected]> Closes #639 from CodingCat/SPARK-1686 and squashes the following commits: 81bb4ca [CodingCat] rename variable 69e0a2a [CodingCat] style fix 36a2ac0 [CodingCat] address Aaron's comments ec9b7bb [CodingCat] address the comments 02b37ca [CodingCat] keep schedule() calling in the main thread

This partitioner uses round robin allocation strategy for keys to end up with balanced partitions for a RDD.

This reverts commit 6668015.

AmplabJenkins · 2014-05-26T01:07:58Z

Can one of the admins verify this patch?

aarondav · 2014-05-26T01:25:48Z

The current contract of Partitioner (though it's not documented, apparently...) is that it is expected to be idempotent and that if two keys are equivalent, they are assigned to the same partition. PairRDDFunctions#lookup makes this assumption, for instance.

It turns out this sort of balanced partitioning is useful, however, and we have encoded it explicitly within RDD#coalesce(). The semantics here match Spark's assumptions about partitioners -- i.e., the resultant RDD has no Partitioner, so no assumption can be made about the colocation of keys in order to do efficient lookups/groupBys/reduceByKeys.

Would this sort of manual repartitioning suit your use-case? Otherwise it would require a rather significant overhaul to Spark's Partitioner semantics.

syedhashmi · 2014-05-26T02:17:41Z

You are right there are routines which make this assumption but this is becoming a pain point for users as they end up with lopsided partitions and especially, if their dataset is huge, some larger partitions become bottleneck and extend the tail of processing time. This partitioner is explicitly targeting such scenarios. If agree upon general idea of partitioner itself, I can add checks to functions assuming Hash or Range partitioning behavior to classify Balanced partitioner as general case. User ends up with exactly balanced partitions and sacrifices a bit at lookup type routines.

srowen · 2014-05-26T08:49:57Z

Yes I had a similar question. This would calculate different partitions for the same key when called from different places and times, and I imagine that causes several methods to fail. For example, what about joining two RDDs both using this partitioner (and with multiple partitions) -- anything that creates a shuffle dependency among two pair RDDs. Surely the different instances of the depended-upon RDD's partitioner will return different partitions for keys and get the answer wrong?

I'm thinking of any time the partitioner instance is copied around -- it will copy state but then its state, which is essential to its answers, varies. Maybe someone more knowledgeable than I can confirm an easy way to test this, or that I really misunderstand and this never happens.

HashPartitioner expects to give fairly balanced partitions already, unless the value's hash function is bad. That is better fixed in the hash function itself.

I had thought the problem was more often in pair RDDs where one key has a lot of values, and operations that group by key create imbalanced partitions? That's not the question here right, that wouldn't be helped by this.

pwendell · 2014-05-27T00:59:05Z

@syedhashmi If you just want to shuffle stuff around randomly (i.e. you lose affinity of keys to specific partitions) then isn't it sufficient to just call repartition on the RDD? I added some code recently that does round robin balancing when calling repartition:

#727

pwendell · 2014-05-27T01:00:11Z

Here is the specific code:
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/RDD.scala#L332

syedhashmi · 2014-05-27T18:45:10Z

@pwendell : You are right ... your patch addresses this scenario. Does it make sense to expose this functionality through a partitioner as that is the intuitive way for most folks or do you think that will be duplication of logic?

mateiz · 2014-05-29T06:13:11Z

This functionality doesn't fit the definition of a Partitioner as used in Spark (which requires it to consistently return the same partition for each key), so it would be confusing to expose it as such. The repartition and coalesce methods are the right way to do it.

In particular, Partitioners are also used to decide whether you can optimize joins and lookups based on a key's partition. This would break that behavior.

…d ssl passwords (apache#876) Co-authored-by: Egor Krivokon <>

…er2Listener (#876) * optimize HiveThriftServer2Listener

…d ssl passwords (apache#876) Co-authored-by: Egor Krivokon <>

Syed Hashmi and others added 4 commits May 9, 2014 16:32

[SPARK-1784] Add a new partitioner

4ca94cc

This change adds a new partitioner which allows users to specify # of keys per partition.

[SPARK-1784] Add a balanced partitioner

fd36542

This partitioner uses round robin allocation strategy for keys to end up with balanced partitions for a RDD.

Revert "SPARK-1686: keep schedule() calling in the main thread"

4354836

This reverts commit 6668015.

[SPARK-1942] Stop clearing spark.driver.port in unit tests

1574769

syedhashmi closed this Jun 2, 2014

agirish pushed a commit to HPEEzmeral/apache-spark that referenced this pull request May 5, 2022

MapR [SPARK-934] Spark and Livy jobs fail on core 7.0.0 with encrypte…

340d9c5

…d ssl passwords (apache#876) Co-authored-by: Egor Krivokon <>

wangyum pushed a commit that referenced this pull request May 26, 2023

[CARMEL-5861]Reduce references to LiveExecutionData in HiveThriftServ…

e443776

…er2Listener (#876) * optimize HiveThriftServer2Listener

udaynpusa pushed a commit to mapr/spark that referenced this pull request Jan 30, 2024

MapR [SPARK-934] Spark and Livy jobs fail on core 7.0.0 with encrypte…

cc56382

…d ssl passwords (apache#876) Co-authored-by: Egor Krivokon <>

mapr-devops pushed a commit to mapr/spark that referenced this pull request May 8, 2025

MapR [SPARK-934] Spark and Livy jobs fail on core 7.0.0 with encrypte…

2669dc6

…d ssl passwords (apache#876) Co-authored-by: Egor Krivokon <>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SPARK[1784]: Adding a balancedPartitioner #876

SPARK[1784]: Adding a balancedPartitioner #876

Uh oh!

syedhashmi commented May 26, 2014

Uh oh!

AmplabJenkins commented May 26, 2014

Uh oh!

aarondav commented May 26, 2014

Uh oh!

syedhashmi commented May 26, 2014

Uh oh!

srowen commented May 26, 2014

Uh oh!

pwendell commented May 27, 2014

Uh oh!

pwendell commented May 27, 2014

Uh oh!

syedhashmi commented May 27, 2014

Uh oh!

mateiz commented May 29, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

SPARK[1784]: Adding a balancedPartitioner #876

SPARK[1784]: Adding a balancedPartitioner #876

Uh oh!

Conversation

syedhashmi commented May 26, 2014

Uh oh!

AmplabJenkins commented May 26, 2014

Uh oh!

aarondav commented May 26, 2014

Uh oh!

syedhashmi commented May 26, 2014

Uh oh!

srowen commented May 26, 2014

Uh oh!

pwendell commented May 27, 2014

Uh oh!

pwendell commented May 27, 2014

Uh oh!

syedhashmi commented May 27, 2014

Uh oh!

mateiz commented May 29, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants