Skip to content

Conversation

@allisonwang-db
Copy link
Contributor

@allisonwang-db allisonwang-db commented Nov 10, 2023

What changes were proposed in this pull request?

This PR makes dynamic Python data source registration session-scoped. Previously, registered data sources were stored in the sharedState and can be referenced by other sessions, which won't work with Spark Connect.

Why are the changes needed?

To make Python data source support Spark Connect in the future.

Does this PR introduce any user-facing change?

No

How was this patch tested?

New unit test

Was this patch authored or co-authored using generative AI tooling?

No

@allisonwang-db
Copy link
Contributor Author

cc @HyukjinKwon

@HyukjinKwon
Copy link
Member

Let's hold off for now if this doesn't block sth - I will followup and make some changes for #43630. If we make this session-based, it will give more work to do for SQL support, etc.

@allisonwang-db
Copy link
Contributor Author

@HyukjinKwon Sounds good! Thanks for looking into it.

@HyukjinKwon HyukjinKwon marked this pull request as draft November 14, 2023 04:15
@HyukjinKwon HyukjinKwon marked this pull request as ready for review November 16, 2023 02:07
@allisonwang-db allisonwang-db force-pushed the spark-45600-session-level branch 2 times, most recently from 0c41406 to 8281347 Compare November 20, 2023 19:34
@allisonwang-db allisonwang-db force-pushed the spark-45600-session-level branch from 8281347 to f6ff4cc Compare November 21, 2023 19:43
@allisonwang-db
Copy link
Contributor Author

cc @HyukjinKwon just fixed the merge conflicts! PTAL thanks

@HyukjinKwon
Copy link
Member

Merged to master.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants