[SPARK-3889] Attempt to avoid SIGBUS by not mmapping files in ConnectionManager #2742

aarondav · 2014-10-10T01:03:58Z

In general, individual shuffle blocks are frequently small, so mmapping them often creates a lot of waste. It may not be bad to mmap the larger ones, but it is pretty inconvenient to get configuration into ManagedBuffer, and besides it is unlikely to help all that much.

…ionManager In general, individual shuffle blocks are frequently small, so mmapping them often creates a lot of waste. It may not be bad to mmap the larger ones, but it is pretty inconvenient to get configuration into ManagedBuffer, and besides it is unlikely to help all that much. Note that user of ManagedBuffer#nioByteBuffer() seems generally bad practice, and would ideally never be used for data that may be large. Users of such data would ideally stream the data instead.

aarondav · 2014-10-10T01:04:50Z

@rxin thoughts on this? You probably have a better idea on whether this would be too much of a perf hit, and whether we should switch into "map" mode for larger blocks.

AmplabJenkins · 2014-10-10T01:17:19Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21562/Test FAILed.

rxin · 2014-10-10T05:34:40Z

core/src/main/scala/org/apache/spark/network/ManagedBuffer.scala

i think the old code had a config parameter for using mem map or this ..

aarondav · 2014-10-10T07:34:48Z

Added a non-configurable version of the memory map pathway, with the threshold you suggested (2MB, the size of a hugepage). Note that this fix will also be included in #2753.

SparkQA · 2014-10-10T07:39:41Z

QA tests have started for PR 2742 at commit a152065.

This patch merges cleanly.

SparkQA · 2014-10-10T08:40:31Z

QA tests have finished for PR 2742 at commit a152065.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2014-10-10T08:40:35Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21583/Test PASSed.

rxin · 2014-10-10T08:44:34Z

LGTM. Merged. Thanks!

mridulm · 2014-10-10T22:30:43Z

This needs to be configurable ... IIRC 1.1 had this customizable - see spark.storage.memoryMapThreshold
Different limits exist for vm vs heap memory in yarn (for example) and so deployments will need to customize this.

aarondav · 2014-10-10T23:12:20Z

@mridulm Could you give an example of which way you would want to shift it via config? Map more or less often?

mridulm · 2014-10-10T23:26:25Z

With 1.1, in expts, we have done both : depending on whether our user code
is mmap'ing too much data (and so we pull things into heap .. using
libraries not in our control :-) ); decreasing it when heap is at premium.
On 11-Oct-2014 4:42 am, "Aaron Davidson" [email protected] wrote:

@mridulm https://github.com/mridulm Could you give an example of which
way you would want to shift it via config? Map more or less often?

—
Reply to this email directly or view it on GitHub
#2742 (comment).

mridulm · 2014-10-10T23:27:46Z

Note: this is reqd since there are heap and vm limits enforced, so we
juggle available memory around so that jobs can run to completion!
On 11-Oct-2014 4:56 am, "Mridul Muralidharan" [email protected] wrote:

With 1.1, in expts, we have done both : depending on whether our user code
is mmap'ing too much data (and so we pull things into heap .. using
libraries not in our control :-) ); decreasing it when heap is at premium.
On 11-Oct-2014 4:42 am, "Aaron Davidson" [email protected] wrote:

@mridulm https://github.com/mridulm Could you give an example of which
way you would want to shift it via config? Map more or less often?

—
Reply to this email directly or view it on GitHub
#2742 (comment).

rxin reviewed Oct 10, 2014
View reviewed changes

Add other pathway back

a152065

asfgit closed this in 90f73fc Oct 10, 2014

[SPARK-3889] Attempt to avoid SIGBUS by not mmapping files in ConnectionManager #2742

[SPARK-3889] Attempt to avoid SIGBUS by not mmapping files in ConnectionManager #2742

Uh oh!

Conversation

aarondav commented Oct 10, 2014

Uh oh!

aarondav commented Oct 10, 2014

Uh oh!

AmplabJenkins commented Oct 10, 2014

Uh oh!

rxin Oct 10, 2014

Choose a reason for hiding this comment

Uh oh!

aarondav commented Oct 10, 2014

Uh oh!

SparkQA commented Oct 10, 2014

Uh oh!

SparkQA commented Oct 10, 2014

Uh oh!

AmplabJenkins commented Oct 10, 2014

Uh oh!

rxin commented Oct 10, 2014

Uh oh!

mridulm commented Oct 10, 2014

Uh oh!

aarondav commented Oct 10, 2014

Uh oh!

mridulm commented Oct 10, 2014

Uh oh!

mridulm commented Oct 10, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants