-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-10304][SQL] Partition discovery should throw an exception if the dir structure is invalid #8840
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #42708 has finished for PR 8840 at commit
|
|
retest this please. |
|
Test build #42718 has finished for PR 8840 at commit
|
|
Jenkins, retest this please. |
|
@chenghao-intel, it looks like this PR's error message improvements are similar to the ones that you added as part of #8026. @viirya, could you take a look at @chenghao-intel's PR to see which approach you like better? |
|
Test build #44001 has finished for PR 8840 at commit
|
|
@JoshRosen ok, I will take a look at it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Move this assert out of the resolvePartitions, since it's not so tight with the original purpose of this function, and we can avoid the code change for this function.
|
Test build #44295 has finished for PR 8840 at commit
|
|
Test build #44316 has finished for PR 8840 at commit
|
|
@chenghao-intel @JoshRosen any comments? Is this patch ready? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we writhe these like this?
val (basePaths, pathsWithPartitionValues) = paths.flatMap {
}.unzip
|
LGTM |
|
Test build #44888 has finished for PR 8840 at commit
|
|
retest this please. |
|
Test build #44906 has finished for PR 8840 at commit
|
|
Should this one be backported to branch-1.5? cc @liancheng @yhuai |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you give a case that we will hit this branch? What will basePaths be at here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Which case is for the one I mentioned in the jira?
|
@rxin I think it is fine to not backport it. Without this, a user will get the DataFrame and when he/she queries it, it will fail (and the error message does not say what's the cause). Since, it does not really let users make any real progress, I think we do not really need to backport it. |
…ition discovery This patch follows up #8840. Author: Liang-Chi Hsieh <[email protected]> Closes #9459 from viirya/detect_invalid_part_dir_following.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do we need !columns.isEmpty?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oh i see. It is for something like table/a=1/_temporary/something, right?
JIRA: https://issues.apache.org/jira/browse/SPARK-10304
This patch detects if the structure of partition directories is not valid.
The test cases are from #8547. Thanks @zhzhan.
cc @liancheng