Stop Copying Every Http Request in Message Handler #44564

original-brownbear · 2019-07-18T13:01:13Z

Copying the request is not necessary for requests that don't escape their content BytesReference in a way that makes it referenced after the request was responded to. We can simply release it once the response has been generated and a lot of Unpooled allocations that way for many if not most requests. For now, this PR makes it so search- and bulk requests are not copied to Unpooled buffers.
Relates Reduce garbage for requests with unpooled buffers #32228
- I think the issue that prevented that PR from being merged was solved by Optimize Bulk Message Parsing and Message Length Parsing #39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer)
I couldn't necessarily reproduce much of a speedup (though no slowdown either) from this change, but I could reproduce a very measurable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me and a ~50% reduction in old gen GC time)
This also improves the real memory circuit breaker efficacy quite a bit I think since it strengthens the correlation between request size and actual memory used by the request (since we don't have a bunch of dangling, to-be-GCed requests hanging around).
Also somewhat relates Throttle Network Reads on Memory Pressure #44484

* Copying the request is not necessary here. We can simply release it once the response has been generated and a lot of `Unpooled` allocations that way * Relates elastic#32228 * I think the issue that preventet that PR that PR from being merged was solved by elastic#39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer) * I couldn't neccessarily reproduce much of a speedup from this change, but I could reproduce a very measureable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me)

elasticmachine · 2019-07-18T13:01:16Z

Pinging @elastic/es-distributed

…ffer

Tim-Brooks · 2019-07-18T23:57:35Z

This PR relies on the fact that all REST actions will be done with the request content by the time they send a response. I'm not sure that this is a safe assumption.

…ffer

original-brownbear · 2019-07-19T08:02:04Z

@tbrooks8 appears there's only a single spot where this doesn't hold: https://github.com/elastic/elasticsearch/pull/44564/files#diff-05216fd3df413bda2df9486bce7a4e29R51

I would also argue that holding on to these bytes without copying is something that shouldn't be happening implicitly anywhere. Otherwise the whole memory behavior of the REST layer becomes pretty unpredictable. In the case of storing the pipeline source it's ok to use the request body for that as a special case, but if we generally allow request contents to be referenced beyond responding to the request the whole notion of the circuit breaker becomes wrong imo.

original-brownbear · 2019-07-19T11:53:07Z

Jenkins run elasticsearch-ci/packaging-sample

server/src/main/java/org/elasticsearch/rest/action/ingest/RestPutPipelineAction.java

Tim-Brooks · 2019-07-22T20:59:35Z

This probably requires a team discuss.

original-brownbear · 2019-07-23T06:14:44Z

Sure, added team-discuss :)

original-brownbear · 2019-07-24T14:06:44Z

Putting this back into WIP, thanks @tbrooks8 for pointing out that org.elasticsearch.transport.netty4.ByteBufBytesReference#toBytesRef is not safe here.

…ffer

original-brownbear · 2019-11-22T14:53:18Z

Thanks Yannick. I simplified the handling of content in 1812be4 and simplified the concurrency of this object through that. It was kind of pointless to optimize the unpooled case with the setting of null this way as explained above.

Not sure about a test for the unpooled buffer case, given the effort required for that. Maybe we could do so (add tricky tests) in a follow-up and use the logic from #44881 that allows for more wholistically optimizing the Unpooled case across all message sizes?

…ffer

ywelsch

LGTM. Let's have @tbrooks8 ok this again as well.

original-brownbear · 2019-12-03T07:51:34Z

Link #49699 which shows OOM from the copies avoided here most likely

original-brownbear · 2019-12-03T10:57:01Z

Ping @tbrooks8 There shouldn't be much left to do here here since the changes from the last time you looked at it are minor :) Thanks!

Tim-Brooks

I guess I missed this in the last review cycle, but I don't understand why we completely reverted this for NIO?

I thought you were going the direction of always copying instead of doing the cast optimization.

I think we should still using the existing buffer when allowsUnsafeBuffers return true. It's just that we should not attempt to do the non-copy when allowsUnsafeBuffers return false and we think the buffer is unpooled (since we do no reliably know that).

I guess we can bring NIO back in a follow-up so I'll approve this.

original-brownbear · 2019-12-03T21:43:53Z

@tbrooks8 thanks!

I guess I missed this in the last review cycle, but I don't understand why we completely reverted this for NIO?

🤦‍♂️ ... I completely misread your comment on the NIO situation back in the intial review ... will open the follow up tomorrow :)

* Copying the request is not necessary here. We can simply release it once the response has been generated and a lot of `Unpooled` allocations that way * Relates elastic#32228 * I think the issue that preventet that PR that PR from being merged was solved by elastic#39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer) * I couldn't neccessarily reproduce much of a speedup from this change, but I could reproduce a very measureable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me)

Same as elastic#44564 but for NIO.

original-brownbear · 2019-12-04T07:41:30Z

NIO version in #49819 :)

* Copying the request is not necessary here. We can simply release it once the response has been generated and a lot of `Unpooled` allocations that way * Relates #32228 * I think the issue that preventet that PR that PR from being merged was solved by #39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer) * I couldn't neccessarily reproduce much of a speedup from this change, but I could reproduce a very measureable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me)

* Copying the request is not necessary here. We can simply release it once the response has been generated and a lot of `Unpooled` allocations that way * Relates elastic#32228 * I think the issue that preventet that PR that PR from being merged was solved by elastic#39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer) * I couldn't neccessarily reproduce much of a speedup from this change, but I could reproduce a very measureable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me)

Same as #44564 but for NIO.

Same as elastic#44564 but for NIO.

Same as #44564 but for NIO.

original-brownbear added >non-issue :Distributed Coordination/Network Http and internode communication implementations v8.0.0 v7.4.0 labels Jul 18, 2019

original-brownbear added 3 commits July 18, 2019 15:54

Merge remote-tracking branch 'elastic/master' into never-copy-http-bu…

cc0aaf6

…ffer

fix

0e8435c

maybe fix

52bc3a6

original-brownbear added 3 commits July 19, 2019 07:01

Merge remote-tracking branch 'elastic/master' into never-copy-http-bu…

1647a74

…ffer

Merge remote-tracking branch 'elastic/master' into never-copy-http-bu…

f413199

…ffer

stop retaining reference to actual request in put pipeline rest action

aa0adb5

original-brownbear added 5 commits July 19, 2019 11:11

add assertion

fe12204

Merge remote-tracking branch 'elastic' into never-copy-http-buffer

558178e

add back empty line

49fc027

Fix NIO as well

2f1fa3f

fix nio as well

957c8a1

original-brownbear commented Jul 19, 2019

View reviewed changes

server/src/main/java/org/elasticsearch/rest/action/ingest/RestPutPipelineAction.java Outdated Show resolved Hide resolved

add comment on copying pipeline source

a36b914

original-brownbear marked this pull request as ready for review July 19, 2019 14:16

original-brownbear requested review from Tim-Brooks and ywelsch July 19, 2019 14:17

original-brownbear added the team-discuss label Jul 23, 2019

original-brownbear removed the team-discuss label Jul 24, 2019

original-brownbear added 2 commits November 22, 2019 14:21

Merge remote-tracking branch 'elastic/master' into never-copy-http-bu…

b161845

…ffer

CR: make content handling less weird

1812be4

original-brownbear requested a review from ywelsch November 22, 2019 14:53

original-brownbear added 2 commits November 28, 2019 07:22

Merge remote-tracking branch 'elastic/master' into never-copy-http-bu…

2e8840a

…ffer

remove unpooled hack

9ab9b4a

ywelsch approved these changes Nov 28, 2019

View reviewed changes

original-brownbear requested a review from Tim-Brooks November 28, 2019 20:02

original-brownbear mentioned this pull request Dec 3, 2019

OutOfMemoryError occurred in coordinating node #49699

Closed

Tim-Brooks approved these changes Dec 3, 2019

View reviewed changes

original-brownbear merged commit 5ddf920 into elastic:master Dec 3, 2019

original-brownbear deleted the never-copy-http-buffer branch December 3, 2019 21:44

original-brownbear mentioned this pull request Dec 3, 2019

Stop Copying Every Http Request in Message Handler (#44564) #49809

Merged

original-brownbear added a commit to original-brownbear/elasticsearch that referenced this pull request Dec 4, 2019

Stop Copying Bulk HTTP Requests in NIO Networking

04a9a9e

Same as elastic#44564 but for NIO.

original-brownbear mentioned this pull request Dec 4, 2019

Stop Copying Bulk HTTP Requests in NIO Networking #49819

Merged

original-brownbear mentioned this pull request Dec 4, 2019

Assert Not Escaping byte[] from Pooled ByteBuf #44881

Closed

original-brownbear added a commit that referenced this pull request Jan 24, 2020

Stop Copying Bulk HTTP Requests in NIO Networking (#49819)

44d5ad9

Same as #44564 but for NIO.

original-brownbear added a commit to original-brownbear/elasticsearch that referenced this pull request Jan 24, 2020

Stop Copying Bulk HTTP Requests in NIO Networking (elastic#49819)

ead3b30

Same as elastic#44564 but for NIO.

original-brownbear mentioned this pull request Jan 24, 2020

Stop Copying Bulk HTTP Requests in NIO Networking (#49819) #51393

Merged

original-brownbear added a commit that referenced this pull request Jan 24, 2020

Stop Copying Bulk HTTP Requests in NIO Networking (#49819) (#51393)

c29b235

Same as #44564 but for NIO.

original-brownbear restored the never-copy-http-buffer branch August 6, 2020 18:33

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Stop Copying Every Http Request in Message Handler #44564

Stop Copying Every Http Request in Message Handler #44564

Uh oh!

Conversation

original-brownbear commented Jul 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Jul 18, 2019

Uh oh!

Tim-Brooks commented Jul 18, 2019

Uh oh!

original-brownbear commented Jul 19, 2019

Uh oh!

original-brownbear commented Jul 19, 2019

Uh oh!

Uh oh!

Tim-Brooks commented Jul 22, 2019

Uh oh!

original-brownbear commented Jul 23, 2019

Uh oh!

original-brownbear commented Jul 24, 2019

Uh oh!

original-brownbear commented Nov 22, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Dec 3, 2019

Uh oh!

original-brownbear commented Dec 3, 2019

Uh oh!

Tim-Brooks left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Dec 3, 2019

Uh oh!

original-brownbear commented Dec 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

original-brownbear commented Jul 18, 2019 •

edited

Loading