-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-8578] [SQL] Should ignore user defined output committer when appending data #6964
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Will create a PR for 1.4 branch. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"save" => "safe"
|
LGTM except for several minor stuff. |
|
test this please. |
|
Test build #35607 has finished for PR 6964 at commit
|
|
Test build #35617 has finished for PR 6964 at commit
|
|
test this please |
|
Typo in the PR title, should be [SPARK-8578] |
|
Changed. Thanks! |
|
Test build #35625 has finished for PR 6964 at commit
|
|
Test build #35639 has finished for PR 6964 at commit
|
|
Thanks for reviewing it! I am merging it to master. |
…ppending data (branch 1.4) This is #6964 for branch 1.4. Author: Yin Huai <[email protected]> Closes #6966 from yhuai/SPARK-8578-branch-1.4 and squashes the following commits: 9c3947b [Yin Huai] Do not use a custom output commiter when appendiing data.
…ppending data (branch 1.4) This is apache#6964 for branch 1.4. Author: Yin Huai <[email protected]> Closes apache#6966 from yhuai/SPARK-8578-branch-1.4 and squashes the following commits: 9c3947b [Yin Huai] Do not use a custom output commiter when appendiing data.
|
@JoshRosen It's a little bit weird that this PR still appears in the PR dashboard even it's merged and closed. |
https://issues.apache.org/jira/browse/SPARK-8578
It is not very safe to use a custom output committer when append data to an existing dir. This changes adds the logic to check if we are appending data, and if so, we use the output committer associated with the file output format.