Skip to content

Conversation

@manuelsh
Copy link
Contributor

Pandarallel allows to execute pandas apply method in parallel, which allows to do data preprocessing faster and easier. This is very helpful in kernel only competitions.

For more information, see:

https://github.com/nalepae/pandarallel

Pandarallel allows to execute pandas apply method in parallel, which allows to do data preprocessing faster and easier. This is very he;pful in kernel only competiotions.

See:

https://github.com/nalepae/pandarallel
@manuelsh
Copy link
Contributor Author

@rosbo kindly review and comment if any, please

@rosbo
Copy link
Contributor

rosbo commented Apr 27, 2021

Hi,

Thanks @manuelsh for adding this package.

Could you also add a simple test_pandarralel.py test file similar to https://github.com/Kaggle/docker-python/blob/master/tests/test_pandas.py to prevent regression of this package?

You will be able to run your test locally using the following command:

./test -i gcr.io/kaggle-images/python:PR-1005-staging -p test_pandarralel.py

This will run your new test against the image built from this PR with the pandarralel package installed.

Thank you

@rosbo rosbo self-assigned this Apr 27, 2021
@rosbo rosbo self-requested a review April 27, 2021 23:47
@rosbo rosbo added the new-package Requests for installing new packages label Apr 27, 2021
Adding a simple test
@manuelsh
Copy link
Contributor Author

I built and run the test, all good! Thanks

test_pandarallel (test_pandarralel.TestPandarallel) ... ok

----------------------------------------------------------------------
Ran 1 test in 0.089s

OK

@rosbo rosbo merged commit 8d00db0 into Kaggle:master Apr 28, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

new-package Requests for installing new packages

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants