Add support for all datasets of the GLUE benchmark

## 🚀 Feature

Add support for all 8 remaining datasets (SST-2 is already supported) of the [GLUE benchmark](https://gluebenchmark.com/): CoLA, MRPC, QQP, STS-B, MNLI, QNLI, RTE, WNLI.

**Motivation**

In itself adding support for all GLUE datasets has a lot of value: GLUE is one of the most widely used benchmark in the NLP community. Furthermore, our planned effort to develop a cohesive API for Multi-Task Learning requires us to enhance our suite our datasets, starting with GLUE.

**Additional context**

We have already created a streamlined dataset API, with a consistent use of DataPipes for dataset download and load operations, as well as a testing methodology relying on mock data (see #1493). This feature will only require to add support for these 8 datasets following that methodology.

## Datasets
 - [x] CoLA #1711
 - [x] MRPC #1712
 - [x] QQP #1713
 - [x] STS-B #1714
 - [x] MNLI #1715
 - [x] QNLI #1717
 - [x] RTE #1721
 - [x] WNLI #1724


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for all datasets of the GLUE benchmark #1710

🚀 Feature

Datasets

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add support for all datasets of the GLUE benchmark #1710

Description

🚀 Feature

Datasets

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions