[SPARK-7698] Cache and reuse buffers in ExecutorMemoryAllocator when using heap allocation #6227

JoshRosen · 2015-05-18T03:08:33Z

When on-heap memory allocation is used, ExecutorMemoryManager should maintain a cache / pool of buffers for re-use by tasks. This will significantly improve the performance of the new Tungsten's sort-shuffle for jobs with many short-lived tasks by eliminating a major source of GC.

This pull request is a minimum-viable-implementation of this idea. In its current form, this patch significantly improves performance on a stress test which launches huge numbers of short-lived shuffle map tasks back-to-back in the same JVM.

AmplabJenkins · 2015-05-18T03:12:10Z

Merged build triggered.

AmplabJenkins · 2015-05-18T03:12:18Z

Merged build started.

SparkQA · 2015-05-18T03:14:12Z

Test build #32963 has started for PR 6227 at commit b154e86.

JoshRosen · 2015-05-18T04:35:20Z

For https://gist.github.com/680ee530655941defcb2, this patch gives a roughly 3x speedup.

SparkQA · 2015-05-18T04:59:17Z

Test build #32963 has finished for PR 6227 at commit b154e86.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-05-18T04:59:22Z

Merged build finished. Test PASSed.

AmplabJenkins · 2015-05-18T04:59:22Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32963/
Test PASSed.

JoshRosen · 2015-05-18T05:01:17Z

Leaving the [WIP] tag on for now while we discuss a few different design decisions.

JoshRosen · 2015-05-18T21:02:20Z

@rxin @zsxwing In the long term, I think that we should consider a more complete design for buffer pooling in our allocator, including thinking through how / whether we want to support pooling for off-heap modes, how we want to match up allocation requests with things in the pages, whether we want to have more manual control over purging pages from the pool, etc. For the immediate 1.4 term, though, I think a super-simple approach like the one in this patch offers a nice improvement. Because the number of different allocation sizes is relatively small (one or two sizes, tops), I think the simple approach is fine for starters.

zsxwing · 2015-05-18T23:47:40Z

unsafe/src/main/java/org/apache/spark/unsafe/memory/ExecutorMemoryManager.java

Just a minor question: I think you want to use LinkedList as Stack since you use pop. Right? If so, here you should use push. push calls addFirst, pop calls removeFirst, while add calls addLast.

Java's linked list is doubly-linked, so I don't think that this makes a perf. difference or anything, which is why I was a little sloppy here.

I thought there was a special reason to use pop :)

JoshRosen · 2015-05-20T01:06:55Z

On closer inspection, I'm thinking that we should probably prefer WeakReferences to SoftReferences, since it's probably better to allow this memory to be released sooner in response to memory demand rather than having the heap grow in order to try to keep empty pages in the pool.

zsxwing · 2015-05-20T01:09:47Z

@JoshRosen why not round up size to a power of 2? I think it's more possible to reuse the objects.

zsxwing · 2015-05-20T01:12:43Z

+1. WeakReferences is better for cache usage.

JoshRosen · 2015-05-20T01:20:46Z

@zsxwing I think that all of our internal requests are already power-of-2 sized, so I don't think that's a concern yet. Right now, I think we'll only end up allocating pages whose sizes are drawn from a very small set (maybe 4 or fewer standard page sizes, tops). We might consider adding the rounding later, though.

zsxwing · 2015-05-20T01:22:35Z

I see. LGTM except the WeakReferences thing.

JoshRosen · 2015-05-20T01:31:21Z

Just pushed a commit to fix the WeakReference thing :)

AmplabJenkins · 2015-05-20T01:32:10Z

Merged build triggered.

AmplabJenkins · 2015-05-20T01:32:16Z

Merged build started.

SparkQA · 2015-05-20T01:33:09Z

Test build #33114 has started for PR 6227 at commit fd6cb55.

zsxwing · 2015-05-20T01:34:56Z

LGTM

SparkQA · 2015-05-20T03:18:18Z

Test build #33114 has finished for PR 6227 at commit fd6cb55.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-05-20T03:18:23Z

Merged build finished. Test PASSed.

AmplabJenkins · 2015-05-20T03:18:24Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33114/
Test PASSed.

rxin · 2015-05-20T23:00:03Z

LGTM

JoshRosen · 2015-05-20T23:15:33Z

Thanks for the review. I'm going to merge this into master and branch-1.4 (1.4.0).

…using heap allocation When on-heap memory allocation is used, ExecutorMemoryManager should maintain a cache / pool of buffers for re-use by tasks. This will significantly improve the performance of the new Tungsten's sort-shuffle for jobs with many short-lived tasks by eliminating a major source of GC. This pull request is a minimum-viable-implementation of this idea. In its current form, this patch significantly improves performance on a stress test which launches huge numbers of short-lived shuffle map tasks back-to-back in the same JVM. Author: Josh Rosen <[email protected]> Closes #6227 from JoshRosen/SPARK-7698 and squashes the following commits: fd6cb55 [Josh Rosen] SoftReference -> WeakReference b154e86 [Josh Rosen] WIP sketch of pooling in ExecutorMemoryManager (cherry picked from commit 7956dd7) Signed-off-by: Josh Rosen <[email protected]>

…using heap allocation When on-heap memory allocation is used, ExecutorMemoryManager should maintain a cache / pool of buffers for re-use by tasks. This will significantly improve the performance of the new Tungsten's sort-shuffle for jobs with many short-lived tasks by eliminating a major source of GC. This pull request is a minimum-viable-implementation of this idea. In its current form, this patch significantly improves performance on a stress test which launches huge numbers of short-lived shuffle map tasks back-to-back in the same JVM. Author: Josh Rosen <[email protected]> Closes apache#6227 from JoshRosen/SPARK-7698 and squashes the following commits: fd6cb55 [Josh Rosen] SoftReference -> WeakReference b154e86 [Josh Rosen] WIP sketch of pooling in ExecutorMemoryManager

WIP sketch of pooling in ExecutorMemoryManager

b154e86

zsxwing reviewed May 18, 2015
View reviewed changes

SoftReference -> WeakReference

fd6cb55

JoshRosen changed the title ~~[SPARK-7698] [WIP] Cache and reuse buffers in ExecutorMemoryAllocator when using heap allocation~~ [SPARK-7698] Cache and reuse buffers in ExecutorMemoryAllocator when using heap allocation May 20, 2015

asfgit closed this in 7956dd7 May 20, 2015

JoshRosen deleted the SPARK-7698 branch May 20, 2015 23:41

[SPARK-7698] Cache and reuse buffers in ExecutorMemoryAllocator when using heap allocation #6227

[SPARK-7698] Cache and reuse buffers in ExecutorMemoryAllocator when using heap allocation #6227

Uh oh!

Conversation

JoshRosen commented May 18, 2015

Uh oh!

AmplabJenkins commented May 18, 2015

Uh oh!

AmplabJenkins commented May 18, 2015

Uh oh!

SparkQA commented May 18, 2015

Uh oh!

JoshRosen commented May 18, 2015

Uh oh!

SparkQA commented May 18, 2015

Uh oh!

AmplabJenkins commented May 18, 2015

Uh oh!

AmplabJenkins commented May 18, 2015

Uh oh!

JoshRosen commented May 18, 2015

Uh oh!

JoshRosen commented May 18, 2015

Uh oh!

zsxwing May 18, 2015

Choose a reason for hiding this comment

Uh oh!

JoshRosen May 20, 2015

Choose a reason for hiding this comment

Uh oh!

zsxwing May 20, 2015

Choose a reason for hiding this comment

Uh oh!

JoshRosen commented May 20, 2015

Uh oh!

zsxwing commented May 20, 2015

Uh oh!

zsxwing commented May 20, 2015

Uh oh!

JoshRosen commented May 20, 2015

Uh oh!

zsxwing commented May 20, 2015

Uh oh!

JoshRosen commented May 20, 2015

Uh oh!

AmplabJenkins commented May 20, 2015

Uh oh!

AmplabJenkins commented May 20, 2015

Uh oh!

SparkQA commented May 20, 2015

Uh oh!

zsxwing commented May 20, 2015

Uh oh!

SparkQA commented May 20, 2015

Uh oh!

AmplabJenkins commented May 20, 2015

Uh oh!

AmplabJenkins commented May 20, 2015

Uh oh!

rxin commented May 20, 2015

Uh oh!

JoshRosen commented May 20, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants