Skip to content

Conversation

@HyukjinKwon
Copy link
Member

What changes were proposed in this pull request?

This PR proposes to explicitly GC between each query of TPC-DS.

Why are the changes needed?

Due to the lack of memory in GitHub Actions machines, they easily die in the middle of testing. This is rather a bandaid workaround.

Does this PR introduce any user-facing change?

No, dev-only.

How was this patch tested?

It has been tested in the fork of Spark in the company I work for.

@github-actions github-actions bot added the SQL label Nov 18, 2021
@HyukjinKwon HyukjinKwon requested review from cloud-fan, dongjoon-hyun and maropu and removed request for maropu November 18, 2021 02:49
Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM although GC is not guaranteed. Thank you.

@HyukjinKwon
Copy link
Member Author

Yeah .. this is hacky and ugly workaround .. so I haven't pushed to OSS so far but .. at least it works ..

@SparkQA
Copy link

SparkQA commented Nov 18, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49837/

@HyukjinKwon
Copy link
Member Author

HyukjinKwon commented Nov 18, 2021

Merged to master.

Let's see how it gose.

@SparkQA
Copy link

SparkQA commented Nov 18, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49837/

@SparkQA
Copy link

SparkQA commented Nov 18, 2021

Test build #145365 has finished for PR 34641 at commit dad0349.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

HyukjinKwon added a commit that referenced this pull request Nov 25, 2021
### What changes were proposed in this pull request?

This is kind of a followup for #33510 and #34641. This PR proposes to split TPC-DS build in GitHub Actions.

### Why are the changes needed?

Running these queries easily causes out-of-memory in GitHub Actions machines, and make the build flaky. We should deflake it.

### Does this PR introduce _any_ user-facing change?

No, dev-only.

### How was this patch tested?

GitHub Actions in this PR should test it out.

Closes #34698 from HyukjinKwon/split-tpcds.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
@HyukjinKwon HyukjinKwon deleted the SPARK-37368 branch January 4, 2022 00:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants