Skip to content

Conversation

@seanpmorgan
Copy link
Member

We run into GPU OOM errors if we allow more than one job. TF claims the entire memory of a card so multiple jobs with a single card causes errors. See:
https://source.cloud.google.com/results/invocations/457625fa-7589-40ca-9f59-c097bac89322/targets/tensorflow_addons%2Fubuntu%2Fgpu%2Fpy2%2Fpresubmit/log

@seanpmorgan seanpmorgan requested a review from a team as a code owner July 12, 2019 13:52
@seanpmorgan seanpmorgan changed the title Limit GPU testing Limit GPU testing jobs Jul 12, 2019
@WindQAQ WindQAQ merged commit 98ae65b into tensorflow:master Jul 12, 2019
@seanpmorgan seanpmorgan deleted the fix-limit-gpu-jobs branch July 12, 2019 16:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants