You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 10, 2025. It is now read-only.
As mentioned in #664, we are working on a new dataset abstraction. The new datasets will be more compatible with pytorch core library and capable of out-of-box libraries (like SentencePiece BPE).
We have landed several datasets in torchtext.experimental.datasets folder to test the new abstraction. Now, we want some help from open-source community. Please sign up here and contribute PRs to re-write those datasets in torchtext. Those datasets in torchtext/experimental/datasets/text_classification should be some good examples to follow. Some datasets but not limited include:
Word language modeling datasets. Already in the experimental folder but need some update to the latest abstraction.