Python JSONPath Version 2 #98

jg-rp · 2025-08-09T08:44:58Z

Looking ahead to Python JSONPath version 2, this PR includes breaking changes for both the Python API and some subtle changes to the default JSONPath syntax. We have:

Changed the lexer so it emits more punctuation and whitespace tokens. Previously we broadly skipped some punctuation and whitespace. Now the parser can make better choices about when to accept whitespace and do a better job of enforcing dots.
Rewritten the parser and its token stream. It should now be more correct and easier to read.
Changed the internal representation of JSONPath segments and selectors. We now model segments explicitly.
Renamed "fake root" to "pseudo-root"
Dropped support for unquoted property names in bracketed segments.

More changes to follow before release:

Implement the Singular path selector.
Implement the keys filter selector.
Remove shorthand arguments to some selector classes. We no longer need them.
Improve leading and trailing whitespace handling options so users can choose how strict to be.
If available, use the regex package instead of re for match and search function extensions.
Document the singular path selector and the keys filter selector

jg-rp · 2025-08-10T06:42:32Z

Some JSONPath performance notes, before attempting any new optimizations.

This benchmark is run on lots of small JSONPath queries with small data.

Main branch (89c0e7e)

(python-jsonpath) james@Jamess-Mac-mini python-jsonpath % python scripts/benchmark.py 
repeating 436 queries 100 times, best of 3 rounds
compile and find               1.392
compile and find (values)      1.400
just compile                   0.917
just find                      0.392
just find (values)             0.395

v2 branch (e41ec29)

(python-jsonpath) james@Jamess-Mac-mini python-jsonpath % python scripts/benchmark.py
repeating 436 queries 100 times, best of 3 rounds
compile and find               1.461
compile and find (values)      1.471
just compile                   0.949
just find                      0.413
just find (values)             0.418

rob-ross · 2025-08-12T19:29:00Z

I am testing my Lexer against your test_lex.py code. It's still a work in progress. But I have converted your test data into a json file. You can get it here .

The only changes I made are :

I changed fake root to pseudo root
I wrapped each test case in a dict/object with a single member "Token". I think this helps make the json file a little more clear, although it introduces a slight wrinkle in your deserialization.

I'll probably be converting more of your tests like this as I proceed. It would make a little more work for you on your end to use them, as you'd have to write a load() method to deserialize them. But it would help us both out in the long run as we could each capture new bugs in the same file without having to modify any python code. And it would help me as you add new features, as I could use test-driven development with updated versions of the file after you introduce new features.

I hope this is useful!

Rob

jg-rp · 2025-08-13T06:27:45Z

I have converted your test data into a json file.

Looks good 👍 I do like "golden files", especially when they apply to multiple projects.

Notice that this pull request - on the v2 branch - has changed tokens produced by the lexer quite a bit. Don't feel obliged to follow v2 instead of main, but it does fix some of the inconsistencies you pointed out in our previous discussions. And, with these changes, we will be able to configure JSONPath to strictly follow RFC 9535 without exception.

rob-ross · 2025-08-13T07:27:20Z

Well it didn't take me long to sour on that idea of wrapping the tokens in a Map. It literally doubles the amount of code I have to write in Java to deserialize it. lol. It's extra characters and thus file size in the json file. So I'm redoing it to be a simpler JSON format, which will also make it easier to load in Python. I can migrate test_lex.json to use the JSON file. I'll probably work on it tomorrow. For me.

jg-rp mentioned this pull request Aug 10, 2025

Non-standard syntax documentation and test coverage #87

Closed

jg-rp added 10 commits August 15, 2025 12:15

Version 2 WIP [skip ci]

4bfcb7c

Rewrite parser WIP [skip ci]

33fe76d

Fix canonical paths, compound paths and list literals

c7a10af

Remove shorthand arguments to Property, Wild and Keys selectors

e338a0c

Add "key" and "keys filter" JSONPath selectors

b4cb9c2

Test "extra" JSONPath syntax

7a55c02

Singular query selector stub [skip ci]

ea84ed9

Implement the singular query selector

9a1886e

Rename some selector classes and tidy.

616f438

Introduce strict mode and use regex if available

a5605a1

jg-rp force-pushed the v2 branch from 00f1905 to a5605a1 Compare August 15, 2025 11:28

jg-rp added 11 commits August 16, 2025 09:44

Add strict lexer rules

e98453d

Test for compliance in strict and lax mode

7e09eaf

Separate test cases for non-standard sytnax in to JSON files WIP

6e1f3b7

Assert that non-standard syntax fails in strict mode

225f686

Remember to test async path too

e087f70

More tidy of test cases

f6909ca

Enforce recursion limit and more tidying

9a73434

More tests and refactor parser.parse_query

71a43ba

Pretty exception messages

f384b63

Update docs WIP [skip ci]

efb0f7d

Syntax docs WIP [skip ci]

dd37e3d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Python JSONPath Version 2 #98

Python JSONPath Version 2 #98

Uh oh!

jg-rp commented Aug 9, 2025 •

edited

Loading

Uh oh!

jg-rp commented Aug 10, 2025 •

edited

Loading

Uh oh!

rob-ross commented Aug 12, 2025

Uh oh!

jg-rp commented Aug 13, 2025

Uh oh!

rob-ross commented Aug 13, 2025

Uh oh!

Uh oh!

Uh oh!

Python JSONPath Version 2 #98

Are you sure you want to change the base?

Python JSONPath Version 2 #98

Uh oh!

Conversation

jg-rp commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jg-rp commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rob-ross commented Aug 12, 2025

Uh oh!

jg-rp commented Aug 13, 2025

Uh oh!

rob-ross commented Aug 13, 2025

Uh oh!

Uh oh!

jg-rp commented Aug 9, 2025 •

edited

Loading

jg-rp commented Aug 10, 2025 •

edited

Loading