gh-101260: Return to a simpler output buffer for zlib, bz2 and lzma for better performance. #101279

rhpvorderman · 2023-01-24T09:27:59Z

See the accompanying issue for a detailed explanation. BlocksOutputBuffer was introduced in Python 3.10 and the benchmarks back then showed increased performance on larger output buffers. Smaller output buffers were not benchmarked however. In the accompanying issue I show why the previous buffer scheme works better for smaller output sizes. Since small data packets and streaming is more common than large in-memory compression it is beneficial to switch back to the 3.9 python style of arranging the buffers.

This also leads to a lot less code.

I changed one thing from the pthon 3.9 code: the lzma and bz2 buffer sizes do not follow a complex grow pattern but simply double the buffer (just like the zlib module). This only really hurts large in-memory compression but that is an anti-pattern anyway.

Issue: BlocksOutputBuffer causes a performance regression in bz2, lzma and zlib modules #101260

# Conflicts: # Modules/_bz2module.c

rhpvorderman added 12 commits January 24, 2023 06:38

Remove blocksoutputbuffer

78271bc

Fix error regarding maximum length

f838c1d

Add missing braces

8793f1a

Set return value properly to NULL in case of error

cf7c244

Remove blocksbuffer from bz2module

723ca68

Remove blocks output buffer

66199e4

More consistent bracing style

852bae0

Grow bz2 buffer more aggresively

a1eff84

Start on lzma module

b98450f

Finish _lzmamodule

6162715

Code style changes

38bef68

Add blurb

52945c6

bedevere-bot added the awaiting review label Jan 24, 2023

bedevere-bot mentioned this pull request Jan 24, 2023

BlocksOutputBuffer causes a performance regression in bz2, lzma and zlib modules #101260

Closed

Merge remote-tracking branch 'upstream/main' into pythongh-101260

b12be9b

# Conflicts: # Modules/_bz2module.c

rhpvorderman force-pushed the gh-101260 branch from 741bb89 to b12be9b Compare May 11, 2023 06:03

gpshead marked this pull request as draft May 11, 2023 06:25

bedevere-bot removed the awaiting review label May 11, 2023

gpshead added the DO-NOT-MERGE label May 11, 2023

rhpvorderman closed this Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-101260: Return to a simpler output buffer for zlib, bz2 and lzma for better performance. #101279

gh-101260: Return to a simpler output buffer for zlib, bz2 and lzma for better performance. #101279

Uh oh!

rhpvorderman commented Jan 24, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

gh-101260: Return to a simpler output buffer for zlib, bz2 and lzma for better performance. #101279

gh-101260: Return to a simpler output buffer for zlib, bz2 and lzma for better performance. #101279

Uh oh!

Conversation

rhpvorderman commented Jan 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rhpvorderman commented Jan 24, 2023 •

edited

Loading