Testing T5Model #1848

pmabbo13 · 2022-07-19T19:54:50Z

Description

Add integration tests for both the T5 model (encoder-only and encoder-decoder)

Process

Upload expected output for example input sequence, sourced from HuggingFace after passing in same input to corresponding HF model -> test/asset/t5.base.encoder.output.pt, test/asset/t5.base.output
Add integration tests to verify that T5_BASE_ENCODER and T5_BASE outputs match the references from HF
Test that bundler methods ( build_model(), get_model(), config() ) are working as expected, and returning the correct error messages when invalid input args are provided.

Testing

pytest test/prototype/integration_tests/test_models.py
pytest test/prototype/models/test_models.py

Follow-Up

Add testing for model training after LM head has been added (need LM head otherwise cannot compute cross-entropy loss) Prepare T5 Model for Language Generation #1862
Investigate if model is torch-scriptable. This is unclear at the moment because the pytorch transformer implementation indicates that transformers are not torch-scriptable if normalization happens first (which it does for T5), and when the query, key, and values are not equal (which is the case for transformer and t5 decoders). Make T5 model torchscriptable #1876

This reverts commit 9837f4a.

This reverts commit bea35bc.

This reverts commit d8fe63e.

…bbo13/text into feature/t5-integration-tests

Nayef211

LGTM! When we decide to move the model and the tests to the main folder, we should look into parameterizing the model and integration tests since all of the tests are almost exactly the same as the ones we have here and here.

Nayef211 · 2022-07-20T14:55:55Z

test/prototype/integration_tests/test_models.py

+
+    def test_t5_base_encoder_model(self):
+        expected_asset_name = "t5.base.encoder.output.pt"
+        model_input = torch.tensor([[1, 2, 3, 4, 5, 6], [7, 8, 9, 0, 0, 0]])


Just something to note, when we implement the transform for the model, we probably want to update the test to pass in an input string to the _t5_model method. The helper function will be responsible for applying the transform on the input string to get the tensor that can be passed into the model (code pointer). The T5Bundle class will also need to be updated to store the model transform as a member variable (code pointer).

sounds good, will keep this in mind for the next task!

parmeet

Overall LGTM! Thanks for adding the integration test.

test/prototype/models/test_models.py

* test bundler api * upload reference results to verify model correctness * test for model correctness against reference results * Revert "test for model correctness against reference results" This reverts commit 9837f4a. * Revert "upload reference results to verify model correctness" This reverts commit bea35bc. * Revert "test bundler api" This reverts commit d8fe63e. * test bundler api * test bundler api * upload reference results to verify model correctness * test for model correctness against reference results * nit correction * test bundler when model is encoder-only * correcting typo * remove redundant test for encoder-only

pmabbo13 added 8 commits July 19, 2022 15:14

test bundler api

d8fe63e

upload reference results to verify model correctness

bea35bc

test for model correctness against reference results

9837f4a

Revert "test for model correctness against reference results"

da193c9

This reverts commit 9837f4a.

Revert "upload reference results to verify model correctness"

a6ee730

This reverts commit bea35bc.

Revert "test bundler api"

4712437

This reverts commit d8fe63e.

test bundler api

9961fbe

test bundler api

dc4cf1e

facebook-github-bot added the cla signed label Jul 19, 2022

pmabbo13 added 5 commits July 19, 2022 15:55

Merge branch 'feature/t5-integration-tests' of https://github.com/pma…

d28fc65

…bbo13/text into feature/t5-integration-tests

upload reference results to verify model correctness

5f914ba

test for model correctness against reference results

aa13fa2

nit correction

38a5d24

test bundler when model is encoder-only

b9df22f

pmabbo13 marked this pull request as ready for review July 19, 2022 21:08

pmabbo13 requested review from Nayef211, abhinavarora and parmeet July 19, 2022 21:09

correcting typo

e32c99e

Nayef211 approved these changes Jul 20, 2022

View reviewed changes

parmeet approved these changes Jul 20, 2022

View reviewed changes

test/prototype/models/test_models.py Show resolved Hide resolved

remove redundant test for encoder-only

d1a23f0

pmabbo13 merged commit ed69973 into pytorch:main Jul 21, 2022

pmabbo13 deleted the feature/t5-integration-tests branch July 21, 2022 15:39

pmabbo13 mentioned this pull request Jul 21, 2022

Add T5 Model and Demo on Text Summarization using CNNDM Dataset #1800

Closed

25 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Testing T5Model #1848

Testing T5Model #1848

Uh oh!

pmabbo13 commented Jul 19, 2022 •

edited

Loading

Uh oh!

Nayef211 left a comment

Uh oh!

Nayef211 Jul 20, 2022

Uh oh!

pmabbo13 Jul 20, 2022

Uh oh!

parmeet left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Testing T5Model #1848

Testing T5Model #1848

Uh oh!

Conversation

pmabbo13 commented Jul 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Process

Testing

Follow-Up

Uh oh!

Nayef211 left a comment

Choose a reason for hiding this comment

Uh oh!

Nayef211 Jul 20, 2022

Choose a reason for hiding this comment

Uh oh!

pmabbo13 Jul 20, 2022

Choose a reason for hiding this comment

Uh oh!

parmeet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pmabbo13 commented Jul 19, 2022 •

edited

Loading