bpo-45045: Optimize mapping patterns of structural pattern matching #28043

corona10 · 2021-08-29T15:58:52Z

https://bugs.python.org/issue45045

corona10 · 2021-08-29T16:00:29Z


+---------------+--------+----------------------+
| Benchmark     | base   | opt                  |
+===============+========+======================+
| bench pattern | 482 ns | 417 ns: 1.15x faster |
+---------------+--------+----------------------+

corona10 · 2021-08-29T16:16:24Z

Python/ceval.c

        goto fail;
    }
-    values = PyList_New(0);
+    values = PyTuple_New(nkeys);


The size of the tuple is predictable.

corona10 · 2021-08-29T16:17:24Z

Python/ceval.c

        }
-        PyObject *value = PyObject_CallFunctionObjArgs(get, key, dummy, NULL);
+        PyObject *args[] = { key, dummy };
+        PyObject *value = PyObject_Vectorcall(get, args, 2, NULL);


Just replacing PyObject_CallFunctionObjArgs shows a 2% performance enhancement on the micro benchmark.

Fidget-Spinner · 2021-08-29T16:40:46Z

The changes LGTM. Tested locally on Win64:

python -m test test_patma -R 3:3
0:00:00 Run tests sequentially
0:00:00 [1/1] test_patma
beginning 6 repetitions
123456
......

== Tests result: SUCCESS ==

1 test OK.

BTW, I was thinking if using _PyObject_GetMethod instead of _PyObject_GetAttrId will make your benchmark faster? The diff from your current is not too large:

@@ -846,7 +846,9 @@ match_keys(PyThreadState *tstate, PyObject *map, PyObject *keys)
     // - Don't cause key creation or resizing in dict subclasses like
     //   collections.defaultdict that define __missing__ (or similar).
     _Py_IDENTIFIER(get);
-    PyObject *get = _PyObject_GetAttrId(map, &PyId_get);
+    PyObject *get_name = _PyUnicode_FromId(&PyId_get); // borrowed
+    PyObject *get = NULL;
+    int meth_found = _PyObject_GetMethod(map, get_name, &get);
     if (get == NULL) {
         goto fail;
     }
@@ -873,8 +875,14 @@ match_keys(PyThreadState *tstate, PyObject *map, PyObject *keys)
             }
             goto fail;
         }
-        PyObject *args[] = { key, dummy };
-        PyObject *value = PyObject_Vectorcall(get, args, 2, NULL);
+        PyObject *args[] = { map, key, dummy };
+        PyObject *value = NULL;
+        if (meth_found) {
+            value = PyObject_Vectorcall(get, args, 3, NULL);
+        }
+        else {
+            value = PyObject_Vectorcall(get, &args[1], 2, NULL);
+        }
         if (value == NULL) {
             goto fail;
         }

corona10 · 2021-08-29T16:50:39Z

@Fidget-Spinner
Yeah it's better!


➜  cpython git:([bpo-45045](https://bugs.python.org/issue45045)) ✗ ./python.exe -m pyperf compare_to --table base.json suggestion.json
+---------------+--------+----------------------+
| Benchmark     | base   | suggestion           |
+===============+========+======================+
| bench pattern | 482 ns | 373 ns: 1.29x faster |
+---------------+--------+----------------------+
➜  cpython git:([bpo-45045](https://bugs.python.org/issue45045)) ✗ ./python.exe -m pyperf compare_to --table opt.json suggestion.json
+---------------+--------+----------------------+
| Benchmark     | opt    | suggestion           |
+===============+========+======================+
| bench pattern | 417 ns | 373 ns: 1.12x faster |
+---------------+--------+----------------------+

corona10 · 2021-08-29T16:53:51Z

With new commit

0:00:00 load avg: 5.05 Run tests sequentially
0:00:00 load avg: 5.05 [1/1] test_patma
beginning 6 repetitions
123456
......

== Tests result: SUCCESS ==

1 test OK.

Total duration: 1.1 sec
Tests result: SUCCESS

Fidget-Spinner

LGTM. Thanks!

corona10 · 2021-08-30T10:03:40Z

@Fidget-Spinner Thanks for the review.

Here is the final benchmark with optimization build with thin LTO :)

+---------------+---------------+----------------------+
| Benchmark     | thin_lto_base | thin_lto_opt         |
+===============+===============+======================+
| bench pattern | 357 ns        | 287 ns: 1.24x faster |
+---------------+---------------+----------------------+

corona10 requested a review from markshannon as a code owner August 29, 2021 15:58

the-knights-who-say-ni added the CLA signed label Aug 29, 2021

bedevere-bot added the awaiting core review label Aug 29, 2021

corona10 added the skip news label Aug 29, 2021

corona10 requested a review from brandtbucher August 29, 2021 15:59

bpo-45045: Optimize mapping patterns of structural pattern matching

696d0bd

corona10 force-pushed the bpo-45045 branch from 183ba01 to 696d0bd Compare August 29, 2021 16:00

corona10 commented Aug 29, 2021

View reviewed changes

Python/ceval.c

goto fail;

}

values = PyList_New(0);

values = PyTuple_New(nkeys);

Copy link

Member Author

corona10 Aug 29, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The size of the tuple is predictable.

corona10 commented Aug 29, 2021

View reviewed changes

bpo-45045: Address code review

c95a7ea

bpo-45045 Add unbound method test case

7a30496

corona10 force-pushed the bpo-45045 branch from fd627dc to 7a30496 Compare August 29, 2021 17:43

corona10 requested a review from Fidget-Spinner August 29, 2021 17:47

bpo-45045: nit

71fe76d

Fidget-Spinner approved these changes Aug 30, 2021

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting core review labels Aug 30, 2021

corona10 merged commit e6497fe into python:main Aug 30, 2021

bedevere-bot removed the awaiting merge label Aug 30, 2021

corona10 deleted the bpo-45045 branch August 30, 2021 10:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

Uh oh!

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

corona10 Aug 29, 2021

Uh oh!

corona10 Aug 29, 2021

Uh oh!

Fidget-Spinner commented Aug 29, 2021 •

edited

Loading

Uh oh!

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

Fidget-Spinner left a comment

Uh oh!

corona10 commented Aug 30, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

bpo-45045: Optimize mapping patterns of structural pattern matching #28043

Uh oh!

Conversation

corona10 commented Aug 29, 2021 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

corona10 Aug 29, 2021

Choose a reason for hiding this comment

Uh oh!

corona10 Aug 29, 2021

Choose a reason for hiding this comment

Uh oh!

Fidget-Spinner commented Aug 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corona10 commented Aug 29, 2021

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

corona10 commented Aug 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

Fidget-Spinner commented Aug 29, 2021 •

edited

Loading

corona10 commented Aug 29, 2021 •

edited by bedevere-bot

Loading

corona10 commented Aug 30, 2021 •

edited

Loading