bpo-37295: Use constant-time comb() for larger n depending on k #30305

serhiy-storchaka · 2021-12-30T19:22:08Z

https://bugs.python.org/issue37295

Modules/mathmodule.c

mdickinson · 2021-12-31T16:54:18Z

Code changes look good to me (modulo the compiler warnings on Windows). I'm running some timings.

mdickinson · 2021-12-31T17:14:43Z

I'm running some timings.

Running the same timings scripts that are posted to the issue, I see roughly a 5% slowdown for small comb(n, k) computations (0 <= k <= n <= 67) on this branch, compared with master. But that's compensated for by the wider applicability of the fast path, so I'm fine with that. (EDIT: Fixed the upper limit for n.)

mdickinson

LGTM

Modules/mathmodule.c

tim-one · 2021-12-31T17:35:17Z

Running the same timings scripts that are posted to the issue, I see roughly a 5% slowdown for small comb(n, k) computations (0 <= k <= n <= 67)

That's really hard to swallow, Mark. At. e.g., comb(50, 15), the pre-existing fast path did 14 64-bit multiplies and - more significantly - also 14 64-bit divides. How on Earth could that be faster than doing instead 2 multiplies and a shift, pretty much no matter how slow popcount is? A similar argument applies for almost all cases taken over by the newer fast path, the more astonishing to see a slowdown the larger k. For k == 1 the new fast path is clearly slower, and k == 2 is unclear without timing, but the new path seems a clear win for k > 2.

Related: it may or may not be a timing win to set fast_comb_limits1[1] to 0, so k==1 takes the second fast path, which just returns n without any arithmetic. Possibly also for fast_comb_limits1[2], depending on how slow division by 2 is.

mdickinson · 2021-12-31T18:23:16Z

That's really hard to swallow, Mark.

I'm comparing this branch with the main branch. (Sorry, I said "master" above, but I meant "main".) The main branch already has the fast mod-2**64 + popcount code in it. This branch has the same, but with some extra up-front indirection, so it's a bit slower.

tim-one · 2021-12-31T18:29:55Z

I'm comparing this branch with the main branch.

Ah, that explains it - sorry for the noise! 😃

Modules/mathmodule.c

arhadthedev · 2022-01-01T11:49:15Z

Modules/mathmodule.c

+        /* Maps k to the maximal n so that 2*k-1 <= n <= 127 and C(n, k)*k
+         * fits into a long long (which is at least 64 bit).  Only contains
+         * items larger than in fast_comb_limits1. */
        static const unsigned long long fast_comb_limits2[] = {


which is at least 64 bit

C99 provides types like uint_least64_t that express such comments explicitly. So I think it will be better to #include <stdint.h> and use it:

Suggested change

/* Maps k to the maximal n so that 2*k-1 <= n <= 127 and C(n, k)*k

* fits into a long long (which is at least 64 bit). Only contains

* items larger than in fast_comb_limits1. */

static const unsigned long long fast_comb_limits2[] = {

/* Maps k to the maximal n so that 2*k-1 <= n <= 127 and C(n, k)*k

* fits into a long long. Only contains

* items larger than in fast_comb_limits1. */

static const uint_least64_t fast_comb_limits2[] = {

It should contain at least LLONG_MAX. There is no guarantee that uint_least64_t is at least so large as positive long long.

mdickinson

Still LGTM

bpo-37295: Use constant-time comb() for larger n depending on k

f9804ac

serhiy-storchaka added performance Performance or resource usage DO-NOT-MERGE labels Dec 30, 2021

serhiy-storchaka requested a review from mdickinson December 30, 2021 19:22

the-knights-who-say-ni added the CLA signed label Dec 30, 2021

bedevere-bot added the awaiting core review label Dec 30, 2021

serhiy-storchaka added 2 commits December 30, 2021 23:25

Optimize also perm().

0029e7e

Silence complier warnings.

4cbeee6

serhiy-storchaka removed the DO-NOT-MERGE label Dec 31, 2021

serhiy-storchaka marked this pull request as ready for review December 31, 2021 10:09

mdickinson reviewed Dec 31, 2021

View reviewed changes

Modules/mathmodule.c Outdated Show resolved Hide resolved

mdickinson approved these changes Dec 31, 2021

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting core review labels Dec 31, 2021

tim-one reviewed Dec 31, 2021

View reviewed changes

Modules/mathmodule.c Outdated Show resolved Hide resolved

serhiy-storchaka mentioned this pull request Dec 31, 2021

bpo-37295: More direct computation of power-of-two factor in math.comb #30313

Merged

serhiy-storchaka commented Dec 31, 2021

View reviewed changes

Modules/mathmodule.c Outdated Show resolved Hide resolved

Modules/mathmodule.c Outdated Show resolved Hide resolved

Modules/mathmodule.c Outdated Show resolved Hide resolved

serhiy-storchaka added 4 commits December 31, 2021 23:47

Merge branch 'main' into fast_comb

f9b0f28

Optimize limit tables and tiny refactoring.

d57c36b

Refactor tests.

0d1ed7c

Improve comments.

78efc24

arhadthedev reviewed Jan 1, 2022

View reviewed changes

mdickinson approved these changes Jan 9, 2022

View reviewed changes

serhiy-storchaka added the skip news label Jan 9, 2022

serhiy-storchaka merged commit 2d78797 into python:main Jan 9, 2022

bedevere-bot removed the awaiting merge label Jan 9, 2022

serhiy-storchaka deleted the fast_comb branch January 9, 2022 13:32

rhettinger mentioned this pull request Oct 6, 2023

Possible optimizations for math.comb() #81476

Closed

Uh oh!

bpo-37295: Use constant-time comb() for larger n depending on k #30305

bpo-37295: Use constant-time comb() for larger n depending on k #30305

Uh oh!

Conversation

serhiy-storchaka commented Dec 30, 2021 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mdickinson commented Dec 31, 2021

Uh oh!

mdickinson commented Dec 31, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdickinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tim-one commented Dec 31, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdickinson commented Dec 31, 2021

Uh oh!

tim-one commented Dec 31, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

arhadthedev Jan 1, 2022

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Jan 1, 2022

Choose a reason for hiding this comment

Uh oh!

mdickinson left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

serhiy-storchaka commented Dec 30, 2021 •

edited by bedevere-bot

Loading

mdickinson commented Dec 31, 2021 •

edited

Loading

tim-one commented Dec 31, 2021 •

edited

Loading