-
Notifications
You must be signed in to change notification settings - Fork 3.6k
CI: debug HPU flow #13419
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
CI: debug HPU flow #13419
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
90433fc
pip list
Borda 681c0e7
ver
Borda 3d625c9
ver
Borda a504900
ver
Borda e2ea4e4
prune
Borda 597f049
user
Borda 65f16a7
find
Borda 48fa044
ls
Borda fcb6021
pypi
Borda ac58035
.
Borda f6e74c1
force
Borda 2d88140
uninstall
Borda 76f6cfb
Update the hpu-tests.yml to pull docker from vault
raoakarsha 8ff238c
runner
Borda 8a9859f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] a7968ab
prune
Borda c053c03
sudo
Borda 434bc8a
sudo
Borda a48920e
sudo
Borda 9b6205c
Merge branch 'ci/hpu2' of https://github.com/PyTorchLightning/pytorch…
Borda c528248
fire & sudo
Borda f0d719b
habana-gaudi-hpus
Borda 19c6b68
Merge branch 'master' into ci/hpu2
Borda 4fd076f
collision
Borda dfa5720
Merge branch 'master' into ci/hpu2
Borda 318ca08
Merge branch 'master' into ci/hpu2
Borda 6c731e0
ci
Borda 8d60792
Merge branch 'master' into ci/hpu2
Borda b13f6bb
Merge branch 'master' into ci/hpu2
Borda 3630c59
params
Borda 7585207
Merge branch 'master' into ci/hpu2
Borda 3e5da23
Check the driver status on gaudi server (#13718)
raoakarsha 89a408e
gk
Borda File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,24 +1,65 @@ | ||
| # Run command to build: | ||
| # gaudi_ver=$(curl -s "https://vault.habana.ai/artifactory/gaudi-docker/" | sed -n 's/.*href="\([^"]*\).*/\1/p' | tail -2 | head -1 | sed "s/\///1") | ||
| # pytorch_install_ver=$(curl -s "https://vault.habana.ai/artifactory/gaudi-docker/$gaudi_ver/ubuntu20.04/habanalabs/" | sed -n 's/.*href="\([^"]*\).*/\1/p'| sed "s/\///1" | grep pytorch-installer) | ||
| # pytorch_install_ver=${pytorch_install_ver/"pytorch-installer-"/""} | ||
| # docker build -t gaudi-docker-agent:latest \ | ||
| # --build-arg GAUDI_VERSION=$gaudi_ver \ | ||
| # --build-arg PYTORCH_INSTALLER_VERSION=$pytorch_install_ver \ | ||
| # -f Dockerfile . | ||
| # Run command: | ||
| # docker run --privileged \ | ||
| # -v /dev:/dev \ | ||
| # -e AZP_URL="https://dev.azure.com/ORGANIZATION/" \ | ||
| # -e AZP_TOKEN="XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX" \ | ||
| # -e AZP_AGENT_NAME="hpu1" \ | ||
| # -e AZP_POOL="intel-hpus" \ | ||
| # gaudi-docker-agent:latest | ||
|
|
||
| ARG DIST="latest" | ||
| ARG GAUDI_VERSION="1.5.0" | ||
| ARG PYTORCH_VERSION="1.11.0" | ||
|
|
||
| FROM vault.habana.ai/gaudi-docker/${GAUDI_VERSION}/ubuntu20.04/habanalabs/pytorch-installer-${PYTORCH_VERSION}:${DIST} | ||
| ARG PYTORCH_INSTALLER_VERSION="1.11.0" | ||
| FROM vault.habana.ai/gaudi-docker/${GAUDI_VERSION}/ubuntu20.04/habanalabs/pytorch-installer-${PYTORCH_INSTALLER_VERSION}:${DIST} | ||
|
|
||
| LABEL maintainer="https://vault.habana.ai/" | ||
| # update the base packages and add a non-sudo user | ||
| RUN \ | ||
| apt-get update -y && \ | ||
| apt-get upgrade -y && \ | ||
| useradd -m docker | ||
|
|
||
| RUN echo "ALL ALL=(ALL) NOPASSWD: ALL" >> /etc/sudoers | ||
|
|
||
| WORKDIR /azp | ||
|
|
||
| COPY ./dockers/ci-runner-hpu/start.sh /usr/local/bin/ | ||
| # To make it easier for build and release pipelines to run apt-get, | ||
| # configure apt to not require confirmation (assume the -y argument by default) | ||
| ENV DEBIAN_FRONTEND=noninteractive | ||
| RUN echo "APT::Get::Assume-Yes \"true\";" > /etc/apt/apt.conf.d/90assumeyes | ||
|
|
||
| RUN chmod +x /usr/local/bin/start.sh | ||
| RUN apt-get update --fix-missing && \ | ||
| apt-get install -y --no-install-recommends \ | ||
| ca-certificates \ | ||
| build-essential \ | ||
| curl \ | ||
| jq \ | ||
| git \ | ||
| iputils-ping \ | ||
| libcurl4 \ | ||
| libunwind8 \ | ||
| netcat \ | ||
| libssl1.0 \ | ||
| libssl-dev \ | ||
| libffi-dev \ | ||
| python3 \ | ||
| python3-venv \ | ||
| python3-dev \ | ||
| python3-pip | ||
|
|
||
| RUN curl -fsSL https://get.docker.com -o get-docker.sh && \ | ||
| sh get-docker.sh && \ | ||
| rm get-docker.sh | ||
|
|
||
| #RUN docker --help | ||
| RUN pip uninstall pytorch-lightning -y | ||
|
|
||
| WORKDIR /azp | ||
|
|
||
| COPY ./dockers/ci-runner-hpu/start.sh /usr/local/bin/ | ||
| RUN chmod +x /usr/local/bin/start.sh | ||
|
|
||
| ENTRYPOINT ["/usr/local/bin/start.sh"] | ||
| CMD ["bash"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.