-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-31578][R] Vectorize schema validation for arrow in types.R #28372
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #121913 has finished for PR 28372 at commit
|
|
Test build #121915 has finished for PR 28372 at commit
|
|
Test build #121918 has finished for PR 28372 at commit
|
|
I am going to merge. The tests are already passed, and I don't believe removing the return causes any test or build to break. |
|
linter failure looks transient to me:
|
### What changes were proposed in this pull request? Repeated `sapply` avoided in internal `checkSchemaInArrow` ### Why are the changes needed? Current implementation is doubly inefficient: 1. Repeatedly doing the same (95%) `sapply` loop 2. Doing scalar `==` on a vector (`==` should be done over the whole vector for efficiency) ### Does this PR introduce any user-facing change? No ### How was this patch tested? By my trusty friend the CI bots Closes #28372 from MichaelChirico/vectorize-types. Authored-by: Michael Chirico <[email protected]> Signed-off-by: HyukjinKwon <[email protected]> (cherry picked from commit 410fa91) Signed-off-by: HyukjinKwon <[email protected]>
|
Test build #121937 has finished for PR 28372 at commit
|
|
Merged to master and branch-3.0. |
What changes were proposed in this pull request?
Repeated
sapplyavoided in internalcheckSchemaInArrowWhy are the changes needed?
Current implementation is doubly inefficient:
sapplyloop==on a vector (==should be done over the whole vector for efficiency)Does this PR introduce any user-facing change?
No
How was this patch tested?
By my trusty friend the CI bots