Skip to content

Commit daa2205

Browse files
authored
[docs] fix table in fp16.mdx (#683)
1 parent 877bec8 commit daa2205

File tree

1 file changed

+10
-33
lines changed

1 file changed

+10
-33
lines changed

docs/source/optimization/fp16.mdx

Lines changed: 10 additions & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -14,39 +14,16 @@ specific language governing permissions and limitations under the License.
1414

1515
We present some techniques and ideas to optimize 🤗 Diffusers _inference_ for memory or speed.
1616

17-
<table>
18-
<tr>
19-
<td>
20-
<td>Latency
21-
<td>Speedup
22-
<tr>
23-
<tr>
24-
<td>original
25-
<td>9.50s
26-
<td>x1
27-
<tr>
28-
<tr>
29-
<td>cuDNN auto-tuner
30-
<td>9.37s
31-
<td>x1.01
32-
<tr>
33-
<td>autocast (fp16)
34-
<td>5.47s
35-
<td>x1.91
36-
<tr>
37-
<td>fp16
38-
<td>3.61s
39-
<td>x2.91
40-
<tr>
41-
<td>channels last
42-
<td>3.30s
43-
<td>x2.87
44-
<tr>
45-
<tr>
46-
<td>traced UNet
47-
<td>3.21s
48-
<td>x2.96
49-
</table>
17+
18+
| | Latency | Speedup |
19+
|------------------|---------|---------|
20+
| original | 9.50s | x1 |
21+
| cuDNN auto-tuner | 9.37s | x1.01 |
22+
| autocast (fp16) | 5.47s | x1.91 |
23+
| fp16 | 3.61s | x2.91 |
24+
| channels last | 3.30s | x2.87 |
25+
| traced UNet | 3.21s | x2.96 |
26+
5027
<em>obtained on NVIDIA TITAN RTX by generating a single image of size 512x512 from the prompt "a photo of an astronaut riding a horse on mars" with 50 DDIM steps.</em>
5128

5229
## Enable cuDNN auto-tuner

0 commit comments

Comments
 (0)