-
Notifications
You must be signed in to change notification settings - Fork 393
Use ArrowScan.to_table to replace project_table #1180
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
* Use ArrowScan.to_table to replace project_table on these file: ** pyiceberg\table\__init__.py ** pyiceberg\io\pyarrow.py ** pyiceberg\test_pyarrow.py
Replace all remaining of project_table using ArrowScan.to_table
|
Hi @JE-Chen thank you for putting together this PR! It looks like these were missed in the initial refactoring. Running the CI now. |
Fix format
Modify by ruff
|
Hi @sungwy I have already fixed the problem that ruff found. |
kevinjqliu
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the contribution!
Looks like this captures all the project_table usage https://github.com/search?q=repo%3Aapache%2Ficeberg-python%20project_table&type=code
Let's double check the log for make test and make test-integration to make sure there's no deprecation warnings related to project_table
|
There are make test-integration logs that the CI test generated.
Log of make test-integration on my VM has so many errors. Any ideas? (It succeeds on CI Test). |
What kind of errors are you running into @JE-Chen ? Could you provide us an example? |
steps
platform linux -- Python 3.11.9, pytest-7.4.4, pluggy-1.5.0 -- /home/jeffrey/Desktop/GItHub_Project/iceberg-python/venv/bin/python |
|
Thank you for sending over the reproducible steps @JE-Chen - I do not use PyCharm for my development (I'm a VSCode user) so it might take a while for me to reproduce your setup.
iceberg-python/pyiceberg/io/__init__.py Lines 320 to 322 in de47590
This is a warning message and I'm familiar with seeing that in my integration tests as well, but it didn't result in any of the tests failing in my case. Are you able to share the verbose error trace for a failing test case, so we can investigate what its failing on? |
|
There are full error log. |
What VM are you running the integration tests on? I have a theory that a previous change is causing memory leaks (see #1167). Can you try to revert it and run the integration test? |
The memory leak still occurred after reverting VMware Workstation 17 Pro (Ubuntu 24.04.1) with these settings: |
|
Gotcha, thanks! I think the CI runs in ubuntu, so it should mirror your own setup |
|
seems like its an issue with your docker installation.
here's what I found for "VMware Workstation 17 Pro " |
|
@JE-Chen I was able to run integration tests for this PR locally. And given that CI also pass, let's merge this and debug your VM environment issue separately. |
Sounds great. |
kevinjqliu
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, confirmed no more warnings
* Use ArrowScan.to_table to replace project_table * Use ArrowScan.to_table to replace project_table on these file: ** pyiceberg\table\__init__.py ** pyiceberg\io\pyarrow.py ** pyiceberg\test_pyarrow.py * Replace all remaining of project_table using ArrowScan.to_table Replace all remaining of project_table using ArrowScan.to_table * Fix format Fix format * Modify by ruff Modify by ruff
* Use ArrowScan.to_table to replace project_table * Use ArrowScan.to_table to replace project_table on these file: ** pyiceberg\table\__init__.py ** pyiceberg\io\pyarrow.py ** pyiceberg\test_pyarrow.py * Replace all remaining of project_table using ArrowScan.to_table Replace all remaining of project_table using ArrowScan.to_table * Fix format Fix format * Modify by ruff Modify by ruff




PR #1119