Demo T5 model on sentiment classification and translation #1872

pmabbo13 · 2022-08-03T22:31:06Z

Description

Update T5 tutorial to include demonstrations of how to use model for sentiment classification on IMDB dataset and English to German translation on Multi30k dataset.

Process

The IMDB dataset was loaded and processed so that the labels neg and pos were changed to negative and positive to align with the labels that would be generated by the model. The input texts were also appended with the prefix sst2 sentence to indicate that the task is sentiment classification (this is the prefix which T5 was trained on for this task).

The Multi30k datasets was loaded and processed so that the english text was pre-pended with the prefix translate English to German to indicate the task to the model.

The same logic used to generate the summarization sequences from CNNDM text was also used to generate the outputs for the IMDB and Multi30k texts. Note that for sentiment translation we use a beam_size=1 since we are looking for the most probable sequence label, which should always be one word.

Also, given that the tutorial will now demo the T5 model on 3 tasks, I've changed the title to "T5-Base Model for Summarization, Sentiment Classification, and Translation", and the file name from cnndm_summarization.py to t5_demo.py.

Testing

Run BUILD_GALLERY=1 make 'SPHINXOPTS=-W' html in docs and review rendered document in docs/build/html/tutorials/t5_demo.html

parmeet

Thanks @pmabbo13! Overall looks great :).

examples/tutorials/t5_demo.py

Nayef211

LGTM as well. Can we just check why the sentiment output looks different from the summarization and translation output in the generated doc

examples/tutorials/t5_demo.py

This reverts commit 9e72d3e.

This reverts commit b133e1f.

This reverts commit 5f2cfbb.

This reverts commit b431da9.

This reverts commit bdc73a5.

This reverts commit 2f43ca2.

This reverts commit 67e2e50.

This reverts commit e50a5a0.

This reverts commit bd129b7.

This reverts commit 9b11269.

demo t5 model on sentiment classification and translation

5be85f3

facebook-github-bot added the cla signed label Aug 3, 2022

pmabbo13 added 3 commits August 3, 2022 18:32

renaming tutorial file

0bb1c1c

update source/index.rst

aa25e6c

correct title format

bb5f192

pmabbo13 mentioned this pull request Aug 4, 2022

Add T5 Model and Demo on Text Summarization using CNNDM Dataset #1800

Closed

25 tasks

pmabbo13 requested review from Nayef211, abhinavarora and parmeet August 4, 2022 13:26

parmeet approved these changes Aug 4, 2022

View reviewed changes

examples/tutorials/t5_demo.py Outdated Show resolved Hide resolved

Nayef211 approved these changes Aug 4, 2022

View reviewed changes

examples/tutorials/t5_demo.py Outdated Show resolved Hide resolved

examples/tutorials/t5_demo.py Outdated Show resolved Hide resolved

examples/tutorials/t5_demo.py Outdated Show resolved Hide resolved

pmabbo13 added 19 commits August 4, 2022 10:30

correct description for generate translations section

31696a7

specifying batch_size variable names

0b3c95e

fixing format issue with sentiment output

6538f6f

renaming tutorial and removing hard-coded outputs

9b11269

update index.rst

bd129b7

adding torchdata dependency for docs build

e50a5a0

torchdata nightly build dependency

67e2e50

torchdata nightly try again

2f43ca2

torchdata nightly try again 2

bdc73a5

try replacing extra-index-url with index-url

b431da9

add extra-index-url to be pypi

5f2cfbb

adding torchdata dependency to config.yml

b133e1f

updating config.yml.in

9e72d3e

Revert "updating config.yml.in"

015bd75

This reverts commit 9e72d3e.

Revert "adding torchdata dependency to config.yml"

0565092

This reverts commit b133e1f.

Revert "add extra-index-url to be pypi"

5c9e33c

This reverts commit 5f2cfbb.

Revert "try replacing extra-index-url with index-url"

bd7b976

This reverts commit b431da9.

Revert "torchdata nightly try again 2"

92ae288

This reverts commit bdc73a5.

Revert "torchdata nightly try again"

36a7f00

This reverts commit 2f43ca2.

pmabbo13 added 6 commits August 5, 2022 17:36

Revert "torchdata nightly build dependency"

8758958

This reverts commit 67e2e50.

Revert "adding torchdata dependency for docs build"

fd4c6d7

This reverts commit e50a5a0.

Revert "update index.rst"

669b211

This reverts commit bd129b7.

Revert "renaming tutorial and removing hard-coded outputs"

35527a4

This reverts commit 9b11269.

correcting typos

8bd65dd

Merge branch 'main' into feature/t5-demo-extended

a0d975a

pmabbo13 merged commit e1b6984 into pytorch:main Aug 8, 2022

pmabbo13 deleted the feature/t5-demo-extended branch August 8, 2022 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Demo T5 model on sentiment classification and translation #1872

Demo T5 model on sentiment classification and translation #1872

Uh oh!

pmabbo13 commented Aug 3, 2022 •

edited

Loading

Uh oh!

parmeet left a comment

Uh oh!

Uh oh!

Nayef211 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Demo T5 model on sentiment classification and translation #1872

Demo T5 model on sentiment classification and translation #1872

Uh oh!

Conversation

pmabbo13 commented Aug 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Process

Testing

Uh oh!

parmeet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Nayef211 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pmabbo13 commented Aug 3, 2022 •

edited

Loading