Skip to content

cherry pick 3700 to 2.8 release: Broadcast removal #3747

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 41 commits into
base: release/2.8
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
eeb956d
fix the broken CC0 image link (#3635)
lanluo-nvidia Jun 30, 2025
b3b5f45
upgrade torch_tensorrt version from 2.8.0.dev to 2.9.0.dev (#3639)
lanluo-nvidia Jun 30, 2025
21cfb78
Temporary fix to workaround the mutable decomposition error. (#3636)
lanluo-nvidia Jun 30, 2025
99ffe1a
Fix dynamo core test failure on Windows (#3642)
HolyWu Jul 2, 2025
fb66d21
Closed the perf gap of resnet and enabled refit (#3629)
cehongwang Jul 2, 2025
cf65477
feat: Refactor LLM model zoo and add KV cache support (#3527)
peri044 Jul 3, 2025
85637b9
docs: [Automated] Regenerating documenation for cf65477
Jul 3, 2025
1d597b3
adding rotary embedding example, with graph rewrite for complex subgr…
apbose Jul 3, 2025
001fe31
docs: [Automated] Regenerating documenation for 1d597b3
Jul 3, 2025
6cf72ab
feat: Add bf16 support to cast converter (#3643)
peri044 Jul 4, 2025
328da32
fix: replace add_identity by add_cast for type cast (#3563)
junstar92 Jul 4, 2025
bbff1fd
docs: [Automated] Regenerating documenation for 328da32
Jul 4, 2025
bb6598d
Refit debug patch (#3620)
cehongwang Jul 4, 2025
2a7121e
docs: [Automated] Regenerating documenation for bb6598d
Jul 4, 2025
0621cda
fix compiler cl not found error in windows (#3660)
lanluo-nvidia Jul 7, 2025
b65bd99
slice scatter support for dynamic cases (#3513)
apbose Jul 7, 2025
0853838
docs: [Automated] Regenerating documenation for b65bd99
Jul 7, 2025
c905cff
fix the int8 quantization failure error (#3663)
lanluo-nvidia Jul 10, 2025
a147c9b
chore(deps): bump transformers from 4.48.0 to 4.52.1 in /tests/module…
dependabot[bot] Jul 10, 2025
c6e4918
chore(deps): bump transformers from 4.50.0 to 4.51.0 in /examples/dyn…
dependabot[bot] Jul 10, 2025
48c07bc
chore(deps): bump transformers from 4.49.0 to 4.51.0 in /tests/py (#3…
dependabot[bot] Jul 10, 2025
def9f6e
docs: [Automated] Regenerating documenation for 48c07bc
Jul 10, 2025
b62a2b7
remove tensorrt as build dependency (#3681)
lanluo-nvidia Jul 14, 2025
307b3fe
docs: [Automated] Regenerating documenation for b62a2b7
Jul 14, 2025
119563b
disable jetpack build for now (#3685)
lanluo-nvidia Jul 15, 2025
e70ae8f
Fixed the CI problem (#3680)
cehongwang Jul 15, 2025
fde92a3
docs: [Automated] Regenerating documenation for e70ae8f
Jul 15, 2025
228ee4d
fix windows build failure: add /utf-8 (#3684)
lanluo-nvidia Jul 16, 2025
e788be5
upgrade tensorrt from 10.11 to 10.12 (#3686)
lanluo-nvidia Jul 16, 2025
75b7774
docs: [Automated] Regenerating documenation for e788be5
Jul 17, 2025
d45ce00
Add Flux fp4 support (#3689)
lanluo-nvidia Jul 18, 2025
63052a5
fix typo
lanluo-nvidia Jul 18, 2025
ac17aba
docs: [Automated] Regenerating documenation for 63052a5
Jul 18, 2025
92a6908
feat: revert linear converter (#3703)
zewenli98 Jul 22, 2025
89b629a
docs: [Automated] Regenerating documenation for 92a6908
Jul 22, 2025
b8c8131
skipped unnecessary broadcast
Jul 21, 2025
af46f1e
Fixed SDPA perf gap
cehongwang Jul 22, 2025
47f78ed
Added comments
cehongwang Jul 23, 2025
5937301
Added fp32 matmul around matmul node
cehongwang Jul 23, 2025
b7d599a
Added updated benchmark script
cehongwang Jul 24, 2025
2ff6387
Fixed a CI issue
cehongwang Jul 29, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .bazelrc
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,7 @@ build:linux --cxxopt="-fdiagnostics-color=always"

build:windows --cxxopt="/GS-" --cxxopt="/std:c++17" --cxxopt="/permissive-"
build:windows --cxxopt="/wd4244" --cxxopt="/wd4267" --cxxopt="/wd4819"
build:windows --cxxopt="/utf-8"
build:windows --features=windows_export_all_symbols

build:python --define=target_lang=python
Expand Down
8 changes: 8 additions & 0 deletions .github/scripts/generate-tensorrt-test-matrix.py
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,10 @@
"urls": "https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.11.0/zip/TensorRT-10.11.0.33.Windows.win10.cuda-12.9.zip",
"strip_prefix": "TensorRT-10.11.0.33",
},
"10.12.0": {
"urls": "https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.12.0/zip/TensorRT-10.12.0.36.Windows.win10.cuda-12.9.zip",
"strip_prefix": "TensorRT-10.12.0.36",
},
},
"linux": {
"10.3.0": {
Expand All @@ -78,6 +82,10 @@
"urls": "https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.11.0/tars/TensorRT-10.11.0.33.Linux.x86_64-gnu.cuda-12.9.tar.gz",
"strip_prefix": "TensorRT-10.11.0.33",
},
"10.12.0": {
"urls": "https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.12.0/tars/TensorRT-10.12.0.36.Linux.x86_64-gnu.cuda-12.9.tar.gz",
"strip_prefix": "TensorRT-10.12.0.36",
},
},
}

Expand Down
21 changes: 11 additions & 10 deletions .github/workflows/build-test-linux-aarch64-jetpack.yml
Original file line number Diff line number Diff line change
@@ -1,16 +1,17 @@
name: Build and test Linux aarch64 wheels for Jetpack

on:
pull_request:
push:
branches:
- main
- nightly
- release/*
tags:
# NOTE: Binary build pipelines should only get triggered on release candidate builds
# Release candidate tags look like: v1.11.0-rc1
- v[0-9]+.[0-9]+.[0-9]+-rc[0-9]+
# TODO: Uncomment this when we have a stable release
# pull_request:
# push:
# branches:
# - main
# - nightly
# - release/*
# tags:
# # NOTE: Binary build pipelines should only get triggered on release candidate builds
# # Release candidate tags look like: v1.11.0-rc1
# - v[0-9]+.[0-9]+.[0-9]+-rc[0-9]+
workflow_dispatch:

jobs:
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/build-test-tensorrt-windows.yml
Original file line number Diff line number Diff line change
Expand Up @@ -254,9 +254,9 @@ jobs:
pushd .
cd tests/py
cd dynamo
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_runtime_test_results.xml --ignore runtime/test_002_cudagraphs_py.py --ignore runtime/test_002_cudagraphs_cpp.py runtime/
../../../packaging/vc_env_helper.bat python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_runtime_test_results.xml --ignore runtime/test_002_cudagraphs_py.py --ignore runtime/test_002_cudagraphs_cpp.py runtime/
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_partitioning_test_results.xml partitioning/
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_lowering_test_results.xml lowering/
../../../packaging/vc_env_helper.bat python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_lowering_test_results.xml lowering/
popd

tests-py-dynamo-cudagraphs:
Expand Down
6 changes: 3 additions & 3 deletions .github/workflows/build-test-windows.yml
Original file line number Diff line number Diff line change
Expand Up @@ -230,7 +230,7 @@ jobs:
cd dynamo
python -m pytest -ra -n 10 --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_compile_be_test_results.xml backend/
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_complete_be_e2e_test_results.xml --ir torch_compile models/test_models.py
python -m pytest -ra --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_compile_dyn_models_export.xml --ir torch_compile models/test_dyn_models.py
../../../packaging/vc_env_helper.bat python -m pytest -ra --junitxml=${RUNNER_TEST_RESULTS_DIR}/torch_compile_dyn_models_export.xml --ir torch_compile models/test_dyn_models.py
popd

tests-py-dynamo-core:
Expand Down Expand Up @@ -258,9 +258,9 @@ jobs:
pushd .
cd tests/py
cd dynamo
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_runtime_test_results.xml --ignore runtime/test_002_cudagraphs_py.py --ignore runtime/test_002_cudagraphs_cpp.py runtime/
../../../packaging/vc_env_helper.bat python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_runtime_test_results.xml --ignore runtime/test_002_cudagraphs_py.py --ignore runtime/test_002_cudagraphs_cpp.py runtime/
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_partitioning_test_results.xml partitioning/
python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_lowering_test_results.xml lowering/
../../../packaging/vc_env_helper.bat python -m pytest -ra -n 4 --junitxml=${RUNNER_TEST_RESULTS_DIR}/tests_py_dynamo_core_lowering_test_results.xml lowering/
popd

tests-py-dynamo-cudagraphs:
Expand Down
12 changes: 6 additions & 6 deletions .github/workflows/docgen.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,12 +14,12 @@ jobs:
if: ${{ ! contains(github.actor, 'pytorchbot') }}
environment: pytorchbot-env
container:
image: docker.io/pytorch/manylinux2_28-builder:cuda12.8
image: docker.io/pytorch/manylinux2_28-builder:cuda12.9
options: --gpus all
env:
CUDA_HOME: /usr/local/cuda-12.8
VERSION_SUFFIX: cu128
CU_VERSION: cu128
CUDA_HOME: /usr/local/cuda-12.9
VERSION_SUFFIX: cu129
CU_VERSION: cu129
CHANNEL: nightly
CI_BUILD: 1
steps:
Expand All @@ -35,14 +35,14 @@ jobs:
- name: Install base deps
run: |
python3 -m pip install pip --upgrade
python3 -m pip install pyyaml numpy torch --pre --extra-index-url https://download.pytorch.org/whl/nightly/cu128
python3 -m pip install pyyaml numpy torch --pre --extra-index-url https://download.pytorch.org/whl/nightly/cu129
./packaging/pre_build_script.sh
- name: Get HEAD SHA
id: vars
run: echo "sha=$(git rev-parse --short HEAD)" >> $GITHUB_OUTPUT
- name: Build Python Package
run: |
python3 -m pip install --pre . --extra-index-url https://download.pytorch.org/whl/nightly/cu128
python3 -m pip install --pre . --extra-index-url https://download.pytorch.org/whl/nightly/cu129
- name: Generate New Docs
run: |
cd docsrc
Expand Down
14 changes: 7 additions & 7 deletions MODULE.bazel
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
module(
name = "torch_tensorrt",
version = "2.8.0a0",
version = "2.9.0a0",
repo_name = "org_pytorch_tensorrt",
)

Expand Down Expand Up @@ -103,18 +103,18 @@ http_archive(
http_archive(
name = "tensorrt",
build_file = "@//third_party/tensorrt/archive:BUILD",
strip_prefix = "TensorRT-10.11.0.33",
strip_prefix = "TensorRT-10.12.0.36",
urls = [
"https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.11.0/tars/TensorRT-10.11.0.33.Linux.x86_64-gnu.cuda-12.9.tar.gz",
"https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.12.0/tars/TensorRT-10.12.0.36.Linux.x86_64-gnu.cuda-12.9.tar.gz",
],
)

http_archive(
name = "tensorrt_sbsa",
build_file = "@//third_party/tensorrt/archive:BUILD",
strip_prefix = "TensorRT-10.11.0.33",
strip_prefix = "TensorRT-10.12.0.36",
urls = [
"https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.11.0/tars/TensorRT-10.11.0.33.Linux.aarch64-gnu.cuda-12.9.tar.gz",
"https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.12.0/tars/TensorRT-10.12.0.36.Linux.aarch64-gnu.cuda-12.9.tar.gz",
],
)

Expand All @@ -130,9 +130,9 @@ http_archive(
http_archive(
name = "tensorrt_win",
build_file = "@//third_party/tensorrt/archive:BUILD",
strip_prefix = "TensorRT-10.11.0.33",
strip_prefix = "TensorRT-10.12.0.36",
urls = [
"https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.11.0/zip/TensorRT-10.11.0.33.Windows.win10.cuda-12.9.zip",
"https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.12.0/zip/TensorRT-10.12.0.36.Windows.win10.cuda-12.9.zip",
],
)

Expand Down
2 changes: 1 addition & 1 deletion cpp/include/torch_tensorrt/macros.h
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
#define STR(x) XSTR(x)

#define TORCH_TENSORRT_MAJOR_VERSION 2
#define TORCH_TENSORRT_MINOR_VERSION 6
#define TORCH_TENSORRT_MINOR_VERSION 9
#define TORCH_TENSORRT_PATCH_VERSION 0
#define TORCH_TENSORRT_VERSION \
STR(TORCH_TENSORRT_MAJOR_VERSION) \
Expand Down
2 changes: 1 addition & 1 deletion dev_dep_versions.yml
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
__cuda_version__: "12.8"
__tensorrt_version__: "10.11.0"
__tensorrt_version__: "10.12.0"
4 changes: 2 additions & 2 deletions docker/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -2,9 +2,9 @@

# Base image starts with CUDA
#TODO: cuda version
ARG BASE_IMG=nvidia/cuda:12.8.0-devel-ubuntu22.04
ARG BASE_IMG=nvidia/cuda:12.9.0-devel-ubuntu22.04
FROM ${BASE_IMG} as base
ENV BASE_IMG=nvidia/cuda:12.8.0-devel-ubuntu22.04
ENV BASE_IMG=nvidia/cuda:12.9.0-devel-ubuntu22.04

ARG TENSORRT_VERSION
ENV TENSORRT_VERSION=${TENSORRT_VERSION}
Expand Down
4 changes: 2 additions & 2 deletions docker/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,14 +15,14 @@

### Instructions

- The example below uses TensorRT 10.11.0.33
- The example below uses TensorRT 10.12.0.36
- See <a href="https://github.com/pytorch/TensorRT#dependencies">dependencies</a> for a list of current default dependencies.

> From root of Torch-TensorRT repo

Build:
```
DOCKER_BUILDKIT=1 docker build --build-arg TENSORRT_VERSION=10.11.0 -f docker/Dockerfile -t torch_tensorrt:latest .
DOCKER_BUILDKIT=1 docker build --build-arg TENSORRT_VERSION=10.12.0 -f docker/Dockerfile -t torch_tensorrt:latest .
```

Run:
Expand Down
2 changes: 1 addition & 1 deletion docker/dist-build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ set -x

TOP_DIR=$(cd $(dirname $0); pwd)/..

BUILD_CMD="python -m pip wheel . --extra-index-url https://download.pytorch.org/whl/nightly/cu128 -w dist"
BUILD_CMD="python -m pip wheel . --extra-index-url https://download.pytorch.org/whl/nightly/cu129 -w dist"

# TensorRT restricts our pip version
cd ${TOP_DIR} \
Expand Down
9 changes: 4 additions & 5 deletions docs/_cpp_api/classtorch__tensorrt_1_1DataType.html
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@

<meta name="viewport" content="width=device-width, initial-scale=1.0">

<title>Class DataType &mdash; Torch-TensorRT v2.8.0.dev0+ee32da0 documentation</title>
<title>Class DataType &mdash; Torch-TensorRT v2.9.0.dev0+92a6908 documentation</title>



Expand Down Expand Up @@ -293,7 +293,7 @@


<div class="version">
v2.8.0.dev0+ee32da0
v2.9.0.dev0+92a6908
</div>


Expand Down Expand Up @@ -324,7 +324,7 @@
<p class="caption" role="heading"><span class="caption-text">Getting Started</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/installation.html">Installation</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/jetpack.html">Overview</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/jetpack.html">Torch-TensorRT in JetPack</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/quick_start.html">Quick Start</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">User Guide</span></p>
Expand Down Expand Up @@ -376,9 +376,8 @@
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_resnet_example.html">Compiling ResNet with dynamic shapes using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_transformers_example.html">Compiling BERT using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion.html">Compiling Stable Diffusion model using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/compile_hf_models.html">Compiling LLM models from Huggingface</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_gpt2.html">Compiling GPT2 using the Torch-TensorRT <code class="docutils literal notranslate"><span class="pre">torch.compile</span></code> frontend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_gpt2.html">Compiling GPT2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_llama2.html">Compiling Llama2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_sam2.html">Compiling SAM2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_flux_dev.html">Compiling FLUX.1-dev model using the Torch-TensorRT dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/notebooks.html">Legacy notebooks</a></li>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@

<meta name="viewport" content="width=device-width, initial-scale=1.0">

<title>Class Device::DeviceType &mdash; Torch-TensorRT v2.8.0.dev0+ee32da0 documentation</title>
<title>Class Device::DeviceType &mdash; Torch-TensorRT v2.9.0.dev0+92a6908 documentation</title>



Expand Down Expand Up @@ -293,7 +293,7 @@


<div class="version">
v2.8.0.dev0+ee32da0
v2.9.0.dev0+92a6908
</div>


Expand Down Expand Up @@ -324,7 +324,7 @@
<p class="caption" role="heading"><span class="caption-text">Getting Started</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/installation.html">Installation</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/jetpack.html">Overview</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/jetpack.html">Torch-TensorRT in JetPack</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/quick_start.html">Quick Start</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">User Guide</span></p>
Expand Down Expand Up @@ -376,9 +376,8 @@
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_resnet_example.html">Compiling ResNet with dynamic shapes using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_transformers_example.html">Compiling BERT using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion.html">Compiling Stable Diffusion model using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/compile_hf_models.html">Compiling LLM models from Huggingface</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_gpt2.html">Compiling GPT2 using the Torch-TensorRT <code class="docutils literal notranslate"><span class="pre">torch.compile</span></code> frontend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_gpt2.html">Compiling GPT2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_llama2.html">Compiling Llama2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_sam2.html">Compiling SAM2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_flux_dev.html">Compiling FLUX.1-dev model using the Torch-TensorRT dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/notebooks.html">Legacy notebooks</a></li>
Expand Down
9 changes: 4 additions & 5 deletions docs/_cpp_api/classtorch__tensorrt_1_1TensorFormat.html
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@

<meta name="viewport" content="width=device-width, initial-scale=1.0">

<title>Class TensorFormat &mdash; Torch-TensorRT v2.8.0.dev0+ee32da0 documentation</title>
<title>Class TensorFormat &mdash; Torch-TensorRT v2.9.0.dev0+92a6908 documentation</title>



Expand Down Expand Up @@ -293,7 +293,7 @@


<div class="version">
v2.8.0.dev0+ee32da0
v2.9.0.dev0+92a6908
</div>


Expand Down Expand Up @@ -324,7 +324,7 @@
<p class="caption" role="heading"><span class="caption-text">Getting Started</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/installation.html">Installation</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/jetpack.html">Overview</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/jetpack.html">Torch-TensorRT in JetPack</a></li>
<li class="toctree-l1"><a class="reference internal" href="../getting_started/quick_start.html">Quick Start</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">User Guide</span></p>
Expand Down Expand Up @@ -376,9 +376,8 @@
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_resnet_example.html">Compiling ResNet with dynamic shapes using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_transformers_example.html">Compiling BERT using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion.html">Compiling Stable Diffusion model using the <cite>torch.compile</cite> backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/compile_hf_models.html">Compiling LLM models from Huggingface</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_compile_gpt2.html">Compiling GPT2 using the Torch-TensorRT <code class="docutils literal notranslate"><span class="pre">torch.compile</span></code> frontend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_gpt2.html">Compiling GPT2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_llama2.html">Compiling Llama2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_sam2.html">Compiling SAM2 using the dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/_rendered_examples/dynamo/torch_export_flux_dev.html">Compiling FLUX.1-dev model using the Torch-TensorRT dynamo backend</a></li>
<li class="toctree-l1"><a class="reference internal" href="../tutorials/notebooks.html">Legacy notebooks</a></li>
Expand Down
Loading
Loading