[SPARK-49217][CORE] Support separate buffer size configuration in UnsafeShuffleWriter #47733

cxzl25 · 2024-08-13T04:48:28Z

What changes were proposed in this pull request?

This PR aims to support separate buffer size configuration in UnsafeShuffleWriter.

Introduce spark.shuffle.file.merge.buffer configuration.

Why are the changes needed?

UnsafeShuffleWriter#mergeSpillsWithFileStream uses spark.shuffle.file.buffer as the buffer for reading spill files, and this buffer is an off-heap buffer.

In the spill process, we hope that the buffer size is larger, but once there are too many files in the spill, UnsafeShuffleWriter#mergeSpillsWithFileStream needs to create a lot of off-heap memory, which makes the executor easily killed by YARN.

spark/core/src/main/java/org/apache/spark/shuffle/sort/UnsafeShuffleWriter.java

Lines 372 to 375 in e72d21c

    
           for (int i = 0; i < spills.length; i++) { 
        
             spillInputStreams[i] = new NioBufferedFileInputStream( 
        
               spills[i].file, 
        
               inputBufferSizeInBytes);

Does this PR introduce any user-facing change?

No

How was this patch tested?

Production environment verification

Was this patch authored or co-authored using generative AI tooling?

No

mridulm

The change looks reasonable.
+CC @JoshRosen as well.

As a side note, should we be using Platform.allocateDirectBuffer for NioBufferedFileInputStream as well @JoshRosen ?

JoshRosen

LGTM, as this looks reasonable to me as well: this is a bit of a low-level configuration but it seems fine to allow it to be tuned.

As a side note, should we be using Platform.allocateDirectBuffer for NioBufferedFileInputStream as well @JoshRosen ?

The difference between ByteBuffer.allocateDirect and Platform.allocateDirectBuffer is that the latter bypasses / ignores the JVM's -XX:MaxDirectMemorySize limit.

Given that almost all other Spark-initiated allocations use the Platform version, we probably should make that change.

That said, I also spot another ByteBuffer.allocateDirect usage at

spark/core/src/main/scala/org/apache/spark/storage/DiskStore.scala

Lines 327 to 328 in 9b9a7a7

    
           private val buffer = ByteBuffer.allocateDirect(64 * 1024) 
        
           buffer.flip()

plus a potential need for additional StorageUtils.dispose() calls for that other call in ReadableChannelFileRegion (which is only used by EncryptedBlockData, as far as I know, though), so perhaps it would be better to update both of those in a separate PR instead of doing it here.

cxzl25 · 2024-08-22T03:49:05Z

I also found another one, which uses Platform.allocateDirectBuffer when initializing, but uses ByteBuffer.allocateDirect when growing.

We can do this in another PR.

spark/core/src/main/scala/org/apache/spark/util/DirectByteBufferOutputStream.scala

Lines 31 to 32 in 4ceacbe

    
           private[spark] class DirectByteBufferOutputStream(capacity: Int) extends OutputStream { 
        
             private var buffer = Platform.allocateDirectBuffer(capacity)

spark/core/src/main/scala/org/apache/spark/util/DirectByteBufferOutputStream.scala

Lines 60 to 63 in 4ceacbe

    
           val newBuffer = ByteBuffer.allocateDirect(newCapacity) 
        
           newBuffer.put(oldBuffer) 
        
           StorageUtils.dispose(oldBuffer) 
        
           buffer = newBuffer

mridulm · 2024-08-23T07:36:39Z

Merged to master.
Thanks for fixing this @cxzl25 !
Thanks for the review @JoshRosen :-)

mridulm · 2024-08-23T07:36:56Z

@cxzl25, please do verify if the jira has been updated correctly - thanks !

…yteBuffer.allocateDirect` ### What changes were proposed in this pull request? This PR aims to use `Platform.allocateDirectBuffer` instead of `ByteBuffer.allocateDirect`. ### Why are the changes needed? #47733 (review) Allocating off-heap memory should use the `allocateDirectBuffer` API provided `by Platform`. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? GA ### Was this patch authored or co-authored using generative AI tooling? No Closes #47987 from cxzl25/SPARK-49509. Authored-by: sychen <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

…yteBuffer.allocateDirect` This PR aims to use `Platform.allocateDirectBuffer` instead of `ByteBuffer.allocateDirect`. #47733 (review) Allocating off-heap memory should use the `allocateDirectBuffer` API provided `by Platform`. No GA No Closes #47987 from cxzl25/SPARK-49509. Authored-by: sychen <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]> (cherry picked from commit 2ed6c3e) Signed-off-by: Dongjoon Hyun <[email protected]>

…afeShuffleWriter ### What changes were proposed in this pull request? This PR aims to support separate buffer size configuration in UnsafeShuffleWriter. Introduce `spark.shuffle.file.merge.buffer` configuration. ### Why are the changes needed? `UnsafeShuffleWriter#mergeSpillsWithFileStream` uses `spark.shuffle.file.buffer` as the buffer for reading spill files, and this buffer is an off-heap buffer. In the spill process, we hope that the buffer size is larger, but once there are too many files in the spill, `UnsafeShuffleWriter#mergeSpillsWithFileStream` needs to create a lot of off-heap memory, which makes the executor easily killed by YARN. https://github.com/apache/spark/blob/e72d21c299a450e48b3cf6e5d36b8f3e9a568088/core/src/main/java/org/apache/spark/shuffle/sort/UnsafeShuffleWriter.java#L372-L375 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Production environment verification ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47733 from cxzl25/SPARK-49217. Authored-by: sychen <[email protected]> Signed-off-by: Mridul Muralidharan <mridul<at>gmail.com>

…yteBuffer.allocateDirect` ### What changes were proposed in this pull request? This PR aims to use `Platform.allocateDirectBuffer` instead of `ByteBuffer.allocateDirect`. ### Why are the changes needed? apache#47733 (review) Allocating off-heap memory should use the `allocateDirectBuffer` API provided `by Platform`. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? GA ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47987 from cxzl25/SPARK-49509. Authored-by: sychen <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

…afeShuffleWriter ### What changes were proposed in this pull request? This PR aims to support separate buffer size configuration in UnsafeShuffleWriter. Introduce `spark.shuffle.file.merge.buffer` configuration. ### Why are the changes needed? `UnsafeShuffleWriter#mergeSpillsWithFileStream` uses `spark.shuffle.file.buffer` as the buffer for reading spill files, and this buffer is an off-heap buffer. In the spill process, we hope that the buffer size is larger, but once there are too many files in the spill, `UnsafeShuffleWriter#mergeSpillsWithFileStream` needs to create a lot of off-heap memory, which makes the executor easily killed by YARN. https://github.com/apache/spark/blob/e72d21c299a450e48b3cf6e5d36b8f3e9a568088/core/src/main/java/org/apache/spark/shuffle/sort/UnsafeShuffleWriter.java#L372-L375 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Production environment verification ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47733 from cxzl25/SPARK-49217. Authored-by: sychen <[email protected]> Signed-off-by: Mridul Muralidharan <mridul<at>gmail.com>

…yteBuffer.allocateDirect` ### What changes were proposed in this pull request? This PR aims to use `Platform.allocateDirectBuffer` instead of `ByteBuffer.allocateDirect`. ### Why are the changes needed? apache#47733 (review) Allocating off-heap memory should use the `allocateDirectBuffer` API provided `by Platform`. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? GA ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47987 from cxzl25/SPARK-49509. Authored-by: sychen <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

…afeShuffleWriter ### What changes were proposed in this pull request? This PR aims to support separate buffer size configuration in UnsafeShuffleWriter. Introduce `spark.shuffle.file.merge.buffer` configuration. ### Why are the changes needed? `UnsafeShuffleWriter#mergeSpillsWithFileStream` uses `spark.shuffle.file.buffer` as the buffer for reading spill files, and this buffer is an off-heap buffer. In the spill process, we hope that the buffer size is larger, but once there are too many files in the spill, `UnsafeShuffleWriter#mergeSpillsWithFileStream` needs to create a lot of off-heap memory, which makes the executor easily killed by YARN. https://github.com/apache/spark/blob/e72d21c299a450e48b3cf6e5d36b8f3e9a568088/core/src/main/java/org/apache/spark/shuffle/sort/UnsafeShuffleWriter.java#L372-L375 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Production environment verification ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47733 from cxzl25/SPARK-49217. Authored-by: sychen <[email protected]> Signed-off-by: Mridul Muralidharan <mridul<at>gmail.com>

…yteBuffer.allocateDirect` ### What changes were proposed in this pull request? This PR aims to use `Platform.allocateDirectBuffer` instead of `ByteBuffer.allocateDirect`. ### Why are the changes needed? apache#47733 (review) Allocating off-heap memory should use the `allocateDirectBuffer` API provided `by Platform`. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? GA ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47987 from cxzl25/SPARK-49509. Authored-by: sychen <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

…yteBuffer.allocateDirect` (apache#557) This PR aims to use `Platform.allocateDirectBuffer` instead of `ByteBuffer.allocateDirect`. apache#47733 (review) Allocating off-heap memory should use the `allocateDirectBuffer` API provided `by Platform`. No GA No Closes apache#47987 from cxzl25/SPARK-49509. Authored-by: sychen <[email protected]> (cherry picked from commit 2ed6c3e) Signed-off-by: Dongjoon Hyun <[email protected]> Co-authored-by: sychen <[email protected]>

buffer

484acb3

github-actions bot added DOCS CORE labels Aug 13, 2024

yaooqinn requested a review from mridulm August 14, 2024 07:04

mridulm reviewed Aug 14, 2024

View reviewed changes

JoshRosen approved these changes Aug 21, 2024

View reviewed changes

mridulm approved these changes Aug 23, 2024

View reviewed changes

mridulm closed this in d84f1a3 Aug 23, 2024

cxzl25 mentioned this pull request Sep 4, 2024

[SPARK-49509][CORE] Use Platform.allocateDirectBuffer instead of ByteBuffer.allocateDirect #47987

Closed

SteNicholas mentioned this pull request Sep 6, 2024

[CELEBORN-1588] Use Platform.allocateDirectBuffer instead of ByteBuffer.allocateDirect apache/celeborn#2729

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-49217][CORE] Support separate buffer size configuration in UnsafeShuffleWriter #47733

[SPARK-49217][CORE] Support separate buffer size configuration in UnsafeShuffleWriter #47733

Uh oh!

cxzl25 commented Aug 13, 2024 •

edited

Loading

Uh oh!

mridulm left a comment

Uh oh!

JoshRosen left a comment

Uh oh!

cxzl25 commented Aug 22, 2024

Uh oh!

mridulm commented Aug 23, 2024

Uh oh!

mridulm commented Aug 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	for (int i = 0; i < spills.length; i++) {
	spillInputStreams[i] = new NioBufferedFileInputStream(
	spills[i].file,
	inputBufferSizeInBytes);

	private val buffer = ByteBuffer.allocateDirect(64 * 1024)
	buffer.flip()

[SPARK-49217][CORE] Support separate buffer size configuration in UnsafeShuffleWriter #47733

[SPARK-49217][CORE] Support separate buffer size configuration in UnsafeShuffleWriter #47733

Uh oh!

Conversation

cxzl25 commented Aug 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

mridulm left a comment

Choose a reason for hiding this comment

Uh oh!

JoshRosen left a comment

Choose a reason for hiding this comment

Uh oh!

cxzl25 commented Aug 22, 2024

Uh oh!

mridulm commented Aug 23, 2024

Uh oh!

mridulm commented Aug 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cxzl25 commented Aug 13, 2024 •

edited

Loading