[SPARK-14290][SPARK-13352][CORE][backport-1.6] avoid significant memory copy in Netty's tran… #12296

liyezhang556520 · 2016-04-11T08:10:22Z

What changes were proposed in this pull request?

When netty transfer data that is not FileRegion, data will be in format of ByteBuf, If the data is large, there will occur significant performance issue because there is memory copy underlying in sun.nio.ch.IOUtil.write, the CPU is 100% used, and network is very low.

In this PR, if data size is large, we will split it into small chunks to call WritableByteChannel.write(), so that avoid wasting of memory copy. Because the data can't be written within a single write, and it will call transferTo multiple times.

How was this patch tested?

Spark unit test and manual test.
Manual test:
sc.parallelize(Array(1,2,3),3).mapPartitions(a=>Array(new Array[Double](1024 * 1024 * 50)).iterator).reduce((a,b)=> a).length

For more details, please refer to SPARK-14290

…sferTo

liyezhang556520 · 2016-04-11T08:11:22Z

cc @davies

SparkQA · 2016-04-11T10:18:25Z

Test build #55516 has finished for PR 12296 at commit 9e37e7c.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

davies · 2016-04-11T17:06:41Z

LGTM, merging this into branch-1.6, thanks!

…emory copy in Netty's tran… ## What changes were proposed in this pull request? When netty transfer data that is not `FileRegion`, data will be in format of `ByteBuf`, If the data is large, there will occur significant performance issue because there is memory copy underlying in `sun.nio.ch.IOUtil.write`, the CPU is 100% used, and network is very low. In this PR, if data size is large, we will split it into small chunks to call `WritableByteChannel.write()`, so that avoid wasting of memory copy. Because the data can't be written within a single write, and it will call `transferTo` multiple times. ## How was this patch tested? Spark unit test and manual test. Manual test: `sc.parallelize(Array(1,2,3),3).mapPartitions(a=>Array(new Array[Double](1024 * 1024 * 50)).iterator).reduce((a,b)=> a).length` For more details, please refer to [SPARK-14290](https://issues.apache.org/jira/browse/SPARK-14290) Author: Zhang, Liye <[email protected]> Closes #12296 from liyezhang556520/apache-branch-1.6-spark-14290.

…emory copy in Netty's tran… When netty transfer data that is not `FileRegion`, data will be in format of `ByteBuf`, If the data is large, there will occur significant performance issue because there is memory copy underlying in `sun.nio.ch.IOUtil.write`, the CPU is 100% used, and network is very low. In this PR, if data size is large, we will split it into small chunks to call `WritableByteChannel.write()`, so that avoid wasting of memory copy. Because the data can't be written within a single write, and it will call `transferTo` multiple times. Spark unit test and manual test. Manual test: `sc.parallelize(Array(1,2,3),3).mapPartitions(a=>Array(new Array[Double](1024 * 1024 * 50)).iterator).reduce((a,b)=> a).length` For more details, please refer to [SPARK-14290](https://issues.apache.org/jira/browse/SPARK-14290) Author: Zhang, Liye <[email protected]> Closes apache#12296 from liyezhang556520/apache-branch-1.6-spark-14290. (cherry picked from commit baf2985) Conflicts: network/common/src/main/java/org/apache/spark/network/protocol/MessageWithHeader.java

SPARK-14290/SPARK-13352 avoid significant memory copy in Netty's tran…

9e37e7c

…sferTo

liyezhang556520 changed the title ~~[SPARK-14290][CORE][backport-1.6] avoid significant memory copy in Netty's tran…~~ [SPARK-14290][SPARK-13352][CORE][backport-1.6] avoid significant memory copy in Netty's tran… Apr 11, 2016

liyezhang556520 mentioned this pull request Apr 11, 2016

[SPARK-14290][CORE][Network] avoid significant memory copy in netty's transferTo #12083

Closed

liyezhang556520 closed this Apr 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-14290][SPARK-13352][CORE][backport-1.6] avoid significant memory copy in Netty's tran… #12296

[SPARK-14290][SPARK-13352][CORE][backport-1.6] avoid significant memory copy in Netty's tran… #12296

Uh oh!

liyezhang556520 commented Apr 11, 2016

Uh oh!

liyezhang556520 commented Apr 11, 2016

Uh oh!

SparkQA commented Apr 11, 2016

Uh oh!

davies commented Apr 11, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-14290][SPARK-13352][CORE][backport-1.6] avoid significant memory copy in Netty's tran… #12296

[SPARK-14290][SPARK-13352][CORE][backport-1.6] avoid significant memory copy in Netty's tran… #12296

Uh oh!

Conversation

liyezhang556520 commented Apr 11, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

liyezhang556520 commented Apr 11, 2016

Uh oh!

SparkQA commented Apr 11, 2016

Uh oh!

davies commented Apr 11, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants