-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-30074][SQL] The maxNumPostShufflePartitions config should obey reducePostShufflePartitions enabled #26664
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| buildConf("spark.sql.adaptive.shuffle.targetPostShuffleInputSize") | ||
| .doc("The target post-shuffle input size in bytes of a task.") | ||
| .doc("The target post-shuffle input size in bytes of a task. This configuration only has " + | ||
| s"an effect when '${ADAPTIVE_EXECUTION_ENABLED.key}' is enabled.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
and also when spark.sql.adaptive.shuffle.reducePostShufflePartitions.enabled is true?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, fix the doc of SHUFFLE_MIN_NUM_POSTSHUFFLE_PARTITIONS and SHUFFLE_TARGET_POSTSHUFFLE_INPUT_SIZE.
|
Test build #114407 has finished for PR 26664 at commit
|
|
Test build #114423 has finished for PR 26664 at commit
|
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
Outdated
Show resolved
Hide resolved
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
Outdated
Show resolved
Hide resolved
|
Test build #114602 has finished for PR 26664 at commit
|
|
retest this please. |
|
Test build #114617 has finished for PR 26664 at commit
|
|
thanks, merging to master! |
|
Thanks! |
| case class EnsureRequirements(conf: SQLConf) extends Rule[SparkPlan] { | ||
| private def defaultNumPreShufflePartitions: Int = | ||
| if (conf.adaptiveExecutionEnabled) { | ||
| if (conf.adaptiveExecutionEnabled && conf.reducePostShufflePartitionsEnabled) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@xuanyuanking is there any UT verifying the number of shuffle partitions dependent on values of these two configurations ?
… reducePostShufflePartitions enabled ### What changes were proposed in this pull request? 1. Make maxNumPostShufflePartitions config obey reducePostShfflePartitions config. 2. Update the description for all the SQLConf affected by `spark.sql.adaptive.enabled`. ### Why are the changes needed? Make the relation between these confs clearer. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Existing UT. Closes apache#26664 from xuanyuanking/SPARK-9853-follow. Authored-by: Yuanjian Li <[email protected]> Signed-off-by: Wenchen Fan <[email protected]>
What changes were proposed in this pull request?
spark.sql.adaptive.enabled.Why are the changes needed?
Make the relation between these confs clearer.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Existing UT.