-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-17513][SQL] Make StreamExecution garbage-collect its metadata #15126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
[SPARK-17513] [STREAMING] [SQL] Make StreamExecution garbage-collect its metadata
|
@petermaxlee why are we not making this change in #15067 ? looks like you forked someone else's PR while it was active. |
|
Test build #65528 has finished for PR 15126 at commit
|
|
Test build #3277 has finished for PR 15126 at commit
|
|
Test build #65527 has finished for PR 15126 at commit
|
|
Since @frreiss hasn't updated the pr yet, I'm going to merge this one and assign the jira ticket to Fred. |
|
Merging in master/2.0. |
## What changes were proposed in this pull request? This PR modifies StreamExecution such that it discards metadata for batches that have already been fully processed. I used the purge method that was added as part of SPARK-17235. This is based on work by frreiss in #15067, but fixed the test case along with some typos. ## How was this patch tested? A new test case in StreamingQuerySuite. The test case would fail without the changes in this pull request. Author: petermaxlee <[email protected]> Author: frreiss <[email protected]> Closes #15126 from petermaxlee/SPARK-17513. (cherry picked from commit be9d57f) Signed-off-by: Reynold Xin <[email protected]>
|
This PR breaks the master branch, I have reverted it, @petermaxlee can you resend and fix the test? thanks! |
## What changes were proposed in this pull request? This PR modifies StreamExecution such that it discards metadata for batches that have already been fully processed. I used the purge method that was added as part of SPARK-17235. This is based on work by frreiss in apache#15067, but fixed the test case along with some typos. ## How was this patch tested? A new test case in StreamingQuerySuite. The test case would fail without the changes in this pull request. Author: petermaxlee <[email protected]> Author: frreiss <[email protected]> Closes apache#15126 from petermaxlee/SPARK-17513.
What changes were proposed in this pull request?
This PR modifies StreamExecution such that it discards metadata for batches that have already been fully processed. I used the purge method that was added as part of SPARK-17235.
This is based on work by @frreiss in #15067, but fixed the test case along with some typos.
How was this patch tested?
A new test case in StreamingQuerySuite. The test case would fail without the changes in this pull request.