-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-20356][SQL] Pruned InMemoryTableScanExec should have correct output partitioning and ordering #17679
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #75927 has finished for PR 17679 at commit
|
|
Test build #75926 has finished for PR 17679 at commit
|
|
@viirya Thank you for a quick fix. The change looks good to me. I have a question. Before the fix, we changed the output partitioning of relation's child. But how come it was not reflected on the plan ? If it was reflected on the plan then we could quickly figure out whats wrong ? Here is the plan before this fix. Is there any indication on the plan that partitioning info got changed ? Just want to learn :-) |
|
@dilipbiswal As outputPartitioning/Ordering is not one of arguments of query plan, it won't be shown in the string representation. |
|
@viirya Ok.. thank you. |
|
Btw, you can sense there might be a problem since the difference of output between |
|
@viirya Isn't that a normal thing simon due to column pruning ? Is that stuff tied to partitioning somehow ? |
|
Oh, as the outputPartitioning/Ordering is strongly related to output, so when the output is changed, it quite indicates the partitioning/ordering can be wrong. |
|
@viirya i see. Thanks :-) |
|
will we return invalid |
|
@cloud-fan I've raised similar question before in a PR. I remember I got an answer that an invalid |
|
A similar example is |
…utput partitioning and ordering ## What changes were proposed in this pull request? The output of `InMemoryTableScanExec` can be pruned and mismatch with `InMemoryRelation` and its child plan's output. This causes wrong output partitioning and ordering. ## How was this patch tested? Jenkins tests. Please review http://spark.apache.org/contributing.html before opening a pull request. Author: Liang-Chi Hsieh <[email protected]> Closes #17679 from viirya/SPARK-20356. (cherry picked from commit 773754b) Signed-off-by: Wenchen Fan <[email protected]>
|
thanks, merging to master/2.2! |
|
Thanks! @cloud-fan |
…utput partitioning and ordering ## What changes were proposed in this pull request? The output of `InMemoryTableScanExec` can be pruned and mismatch with `InMemoryRelation` and its child plan's output. This causes wrong output partitioning and ordering. ## How was this patch tested? Jenkins tests. Please review http://spark.apache.org/contributing.html before opening a pull request. Author: Liang-Chi Hsieh <[email protected]> Closes apache#17679 from viirya/SPARK-20356.
What changes were proposed in this pull request?
The output of
InMemoryTableScanExeccan be pruned and mismatch withInMemoryRelationand its child plan's output. This causes wrong output partitioning and ordering.How was this patch tested?
Jenkins tests.
Please review http://spark.apache.org/contributing.html before opening a pull request.