[SPARK-10193] [core] [wip] Eliminate Skipped Stages by reusing ShuffleMapStages #8427

squito · 2015-08-25T19:15:17Z

This eliminates "skipped" stages for jobs that share shuffle dependencies, and instead reuses the same stage. This is done by not removing ShuffleMapStages when a job finishes, but waiting till the shuffle is cleaned by the context cleaner. It does increase memory usage with long jobs with lots of stages (though its the same order as before, since we already hold on to shuffle data in MapOutputTracker).

The advantage is simplified code and a clearer experience for the end user -- jobs which reference an already completed stage link to the already completed stage, rather than referencing a new stage which gets "skipped", which is always confusing. (Perhaps it could still use a better UI treatment to make it clear that stage had already completed as part of a previous job.)

…ext cleaner

squito · 2015-08-25T19:59:01Z

Jenkins, retest this please

markhamstra · 2015-08-25T20:03:41Z

This is going to increase memory pressure. The very early code never cleaned up the Stage-tracking data structures at all, which was clearly unacceptable for long-running Applications. What we have now cleans up as soon as possible, and thus has minimal memory pressure. What you have in this PR lands somewhere in between, and could cause problems if a lot of Stages stick around for a long time.

SparkQA · 2015-08-25T20:07:39Z

Test build #41556 has finished for PR 8427 at commit 830e0c8.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- abstract class SetOperation(left: LogicalPlan, right: LogicalPlan) extends BinaryNode
- case class Union(left: LogicalPlan, right: LogicalPlan) extends SetOperation(left, right)
- case class Intersect(left: LogicalPlan, right: LogicalPlan) extends SetOperation(left, right)
- case class Except(left: LogicalPlan, right: LogicalPlan) extends SetOperation(left, right)

squito · 2015-08-25T21:13:21Z

@markhamstra yup, no question this will increase memory usage. The question is, should we consider it anyway? Maybe you were implicitly answering "no", but I'm gonna make my case again in any case :)

Clearly, if you have long running jobs w/ lots of stages, and you never do anything to clean them up, then stageIdToStage is going to eat up all your memory. But that will happen anyway, you'll already run out of memory because of MapOutputTracker storing shuffle output (and most likely the huge number of RDDs you've created that can't be gc'ed either). We add a few more hashmap entries and more Stage objects, which shouldn't contain anything huge -- no bigger than what we are already tracking. Certainly it'll have an effect, though.

I think its a pretty big usability improvement, so worth considering, but that is totally subjective. I realize this is a bit hand wavy now -- I'll try to quantify the memory usage effect so we can make a more informed decision (if others are still interested somewhat).

markhamstra · 2015-08-25T21:55:32Z

I wasn't intending to answer "no", but rather just wanting to make sure that we think through the implications of this change. It will increase memory pressure some, but I agree that it shouldn't be a lot because of the already present references via the MapOutputTracker. On balance, I'm inclined to agree with you that this is worth doing.

SparkQA · 2015-08-26T17:44:19Z

Test build #41625 timed out for PR 8427 at commit 4931ccc after a configured wait of 175m.

mridulm · 2016-01-24T09:17:00Z

Just a note about MapOutputTracker - it is fairly trivial to make it use bare minimum amount of memory even if it does not get cleaned up for 'old' stages : using a disk backed map (mapdb for example) via LRU.
Which keeps utmost current and previous map output in memory and everything else on disk (until there is a node failure requiring recomputation - which brings portions of this back into memory).

This is what we used to do for production jobs in some earlier projects.

I am not sure what the impact of the current proposal is from memory overhead pov - map output was (obviously) expensive enough to attempt this and the affect was not pervasive/diffuse across the codebase for shuffle output tracking.

rxin · 2016-06-15T22:20:37Z

Thanks for the pull request. I'm going through a list of pull requests to cut them down since the sheer number is breaking some of the tooling we have. Due to lack of activity on this pull request, I'm going to push a commit to close it. Feel free to reopen it or create a new one. We can also continue the discussion on the JIRA ticket.

squito added 7 commits August 25, 2015 14:09

wip on understanding skipped stages

f87811d

more wip

1fdee52

dont remove shuffleMapStages on job completion, leave it for the cont…

eb87626

…ext cleaner

simple test w/ failure involving a shared dependency

dd18805

fixes for cleaning of multiple stages

6763a6f

cleanup

5123b4d

revert unnecessary change

830e0c8

squito changed the title ~~[SPARK-10193] [core] [wip]~~ [SPARK-10193] [core] [wip] Eliminate Skipped Stages by reusing ShuffleMapStages Aug 25, 2015

squito added 2 commits August 26, 2015 09:37

style

9c86e47

Merge branch 'master' into SPARK-10193

4931ccc

squito mentioned this pull request Sep 4, 2015

[SPARK-9851] Support submitting map stages individually in DAGScheduler #8180

Closed

markhamstra mentioned this pull request Apr 27, 2016

[SPARK-13902][SCHEDULER] Make DAGScheduler not to create duplicate stage. #12655

Closed

asfgit closed this in 1a33f2e Jun 15, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-10193] [core] [wip] Eliminate Skipped Stages by reusing ShuffleMapStages #8427

[SPARK-10193] [core] [wip] Eliminate Skipped Stages by reusing ShuffleMapStages #8427

Uh oh!

squito commented Aug 25, 2015

Uh oh!

squito commented Aug 25, 2015

Uh oh!

markhamstra commented Aug 25, 2015

Uh oh!

SparkQA commented Aug 25, 2015

Uh oh!

squito commented Aug 25, 2015

Uh oh!

markhamstra commented Aug 25, 2015

Uh oh!

SparkQA commented Aug 26, 2015

Uh oh!

mridulm commented Jan 24, 2016

Uh oh!

rxin commented Jun 15, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-10193] [core] [wip] Eliminate Skipped Stages by reusing ShuffleMapStages #8427

[SPARK-10193] [core] [wip] Eliminate Skipped Stages by reusing ShuffleMapStages #8427

Uh oh!

Conversation

squito commented Aug 25, 2015

Uh oh!

squito commented Aug 25, 2015

Uh oh!

markhamstra commented Aug 25, 2015

Uh oh!

SparkQA commented Aug 25, 2015

Uh oh!

squito commented Aug 25, 2015

Uh oh!

markhamstra commented Aug 25, 2015

Uh oh!

SparkQA commented Aug 26, 2015

Uh oh!

mridulm commented Jan 24, 2016

Uh oh!

rxin commented Jun 15, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants