[CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks. #1544

woshilaiceshide · 2014-07-23T07:06:53Z

Make spark's "local[N]" better.
In our company, we use "local[N]" in production. It works exellentlly. It's our best choice.

…uld be touched by "spark.task.cpus" for every finish/start-up of tasks.

AmplabJenkins · 2014-07-23T07:07:00Z

Can one of the admins verify this patch?

rxin · 2014-07-23T07:48:19Z

Jenkins, test this please.

rxin · 2014-07-23T07:48:41Z

Do you mind creating a JIRA ticket add add the ticket title to the pull request, like other PRs do? Thanks!

issues.apache.org/jira/browse/SPARK

SparkQA · 2014-07-23T07:53:24Z

QA tests have started for PR 1544. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17024/consoleFull

SparkQA · 2014-07-23T09:33:10Z

QA results for PR 1544:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17024/consoleFull

mateiz · 2014-07-23T18:01:32Z

This makes sense but I'm slightly confused by it, why not just launch local[N] with a smaller N if you want fewer threads? Because this setting is the same for each task.

mateiz · 2014-07-23T18:07:16Z

BTW I've merged this, thanks for the patch.

woshilaiceshide · 2014-07-24T02:10:10Z

@mateiz, because in spark-v1.0.1, "spark.default.parallelism" is not considered in class LocalBackend, which is assigned to totalCores, and totalCores is derived from "local[N]". In spark-v1.0.1, if I want to increase the default parallelism(p), I should increase N in "local[N]", which increases the number(t) of tasks that can be launched in the only local executor, so "spark.task.cpus"(c) comes in. Finally, I make the equation: p-(t-1)+1 = c*2 , which will be true when the split factor is big enough. Be sure to refer to https://github.com/apache/spark/blob/v1.0.1/core/src/main/scala/org/apache/spark/scheduler/local/LocalBackend.scala
But, this bug is revised in the current master: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/local/LocalBackend.scala

We use "local[N]" in our production, so we paied more attention to "local[N]".

junnyxi · 2014-07-24T02:24:35Z

waiting the result.

…uld be touched by "spark.task.cpus" for every finish/start-up of tasks. Make spark's "local[N]" better. In our company, we use "local[N]" in production. It works exellentlly. It's our best choice. Author: woshilaiceshide <[email protected]> Closes apache#1544 from woshilaiceshide/localX and squashes the following commits: 6c85154 [woshilaiceshide] [CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.

[CORE] SPARK-2640: In "local[N]", free cores of the only executor sho…

6c85154

…uld be touched by "spark.task.cpus" for every finish/start-up of tasks.

asfgit closed this in f776bc9 Jul 23, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks. #1544

[CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks. #1544

Uh oh!

woshilaiceshide commented Jul 23, 2014

Uh oh!

AmplabJenkins commented Jul 23, 2014

Uh oh!

rxin commented Jul 23, 2014

Uh oh!

rxin commented Jul 23, 2014

Uh oh!

SparkQA commented Jul 23, 2014

Uh oh!

SparkQA commented Jul 23, 2014

Uh oh!

mateiz commented Jul 23, 2014

Uh oh!

mateiz commented Jul 23, 2014

Uh oh!

woshilaiceshide commented Jul 24, 2014

Uh oh!

junnyxi commented Jul 24, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks. #1544

[CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks. #1544

Uh oh!

Conversation

woshilaiceshide commented Jul 23, 2014

Uh oh!

AmplabJenkins commented Jul 23, 2014

Uh oh!

rxin commented Jul 23, 2014

Uh oh!

rxin commented Jul 23, 2014

Uh oh!

SparkQA commented Jul 23, 2014

Uh oh!

SparkQA commented Jul 23, 2014

Uh oh!

mateiz commented Jul 23, 2014

Uh oh!

mateiz commented Jul 23, 2014

Uh oh!

woshilaiceshide commented Jul 24, 2014

Uh oh!

junnyxi commented Jul 24, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants