Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.
Closed
Changes from all commits
Commits
Show all changes
751 commits
Select commit Hold shift + click to select a range
0f7f859
Cache extraction for AmazonReviewPolarity (#1527)
parmeet Jan 20, 2022
eb39945
Migrating PennTreebank to datapipes (#1511)
Nayef211 Jan 20, 2022
1231709
Migrating EnWik9 to datapipes #1511 (#1512)
Nayef211 Jan 20, 2022
83aebf4
add double caching for yelp polarity to speed up extracted reading. (…
erip Jan 20, 2022
03afb7e
Migrate IMDB to datapipes (#1531)
kevinchn Jan 20, 2022
e1d66cf
add max_tokens kwarg to vocab factory. (#1525)
erip Jan 21, 2022
ff78e99
add double caching for yahoo to speed up extracted reading. (#1528)
erip Jan 21, 2022
437eea8
Migrate WikiText2 to datapipes (#1519)
abhinavarora Jan 21, 2022
d19a77e
add double caching for yelp full to speed up extracted reading. (#1529)
erip Jan 21, 2022
042f12f
Migrate WikiText103 to datapipes (#1518)
abhinavarora Jan 21, 2022
f685c55
add initial pass at migrating UDPOS to datapipes. (#1535)
erip Jan 22, 2022
627c71f
migrate Multi30k to datapipes. (#1536)
erip Jan 24, 2022
ce1ce99
Add AmazonReviewPolarity Mocked Unit Test (#1532)
Nayef211 Jan 24, 2022
d72124c
Migrate SST2 from experimental to datasets folder (#1538)
Nayef211 Jan 26, 2022
e0c5528
Rename AmazonReviewPolarity test file (#1540)
Nayef211 Jan 27, 2022
91dde7e
Removing unused param args constant (#1544)
Nayef211 Jan 27, 2022
7f839b6
Add SST2 Mocked Unit Test (#1542)
Nayef211 Jan 28, 2022
169924b
Convert _get_mock_dataset fn to be private (#1543)
Nayef211 Jan 28, 2022
fe09343
Updated test to be consistent with SST2 test (#1548)
Nayef211 Jan 28, 2022
1b2f12e
fix yelp dataset (#1550)
parmeet Jan 28, 2022
5056218
fix yahoo dataset (#1551)
parmeet Jan 28, 2022
9561cde
fix penn dataset (#1552)
parmeet Jan 28, 2022
15c4222
mock up AG NEWS test for faster testing. (#1553)
erip Jan 30, 2022
c10d7ef
migrate IWSLT2016 to datapipes. (#1545)
erip Jan 30, 2022
f27047f
remove extra print (#1557)
erip Jan 31, 2022
2372682
fix flake. (#1558)
erip Jan 31, 2022
1f17c0a
migrate IWSLT2017 to datapipes. (#1547)
erip Jan 31, 2022
448a791
Implement ClipTokenizer that builds on top of GPT2BPETokenizer (#1541)
abhinavarora Feb 1, 2022
3ba62ca
mock up IWSLT2016 test for faster testing. (#1563)
erip Feb 2, 2022
69825a1
Multi30k mocked testing (#1554)
parmeet Feb 2, 2022
582da0f
fix typo in torchtext/vocab/vocab_factory.py (#1565)
A03ki Feb 3, 2022
3f2c593
Add AmazonReviewFull Mocked Unit Test (#1561)
vcm2114 Feb 3, 2022
a79035f
Add EnWik9 Mocked Unit Test (#1560)
vcm2114 Feb 3, 2022
339804f
[Bug-Fix] fix hash for datasets testing (#1572)
parmeet Feb 3, 2022
14fef0f
Add YelpReviewFull Mocked Unit Test (#1568)
vcm2114 Feb 3, 2022
fdcbad2
Add UDPOS Mocked Unit Test (#1569)
vcm2114 Feb 3, 2022
7f3ed4b
Fix handling of end of file while reading vocab from file (#1573)
abhinavarora Feb 4, 2022
012ab2a
Added SST2 dataset properties to raw_datasets json file (#1580)
Nayef211 Feb 4, 2022
c434fcc
Add SQuAD1 Mocked Unit Test (#1574)
Nayef211 Feb 4, 2022
08c774d
Added sogounews mocked test (#1576)
Nayef211 Feb 4, 2022
aa78b86
add CC100 (#1562)
erip Feb 4, 2022
bef9304
Add PennTreebank Mocked Unit Test (#1578)
Nayef211 Feb 4, 2022
10fe0f8
Add YahooAnswers Mocked Unit Test #1574 (#1577)
Nayef211 Feb 4, 2022
fd50fb2
Add DBpedia Mocked Unit Test (#1566)
vcm2114 Feb 4, 2022
0b38890
Renamed variable name to base_dir (#1582)
Nayef211 Feb 5, 2022
7bd6b72
Fix flake issue (#1584)
parmeet Feb 5, 2022
d8f9559
Remove real dataset caching and testing (#1587)
parmeet Feb 7, 2022
223584b
add CC100 mocking test (#1583)
parmeet Feb 7, 2022
d8a0df4
Add SQuAD2 Mocked Unit Test (#1575)
Nayef211 Feb 7, 2022
3a5c2d3
non-distributed training example for SST2 dataset using XLM-Roberta m…
parmeet Feb 8, 2022
fed25fe
Updating Conll2000Chunking dataset to be consistent with other datase…
Nayef211 Feb 8, 2022
99eb1f8
Added caching to extracted files in debpedia (#1571)
Nayef211 Feb 8, 2022
84b719e
Merge YelpReviewPolarity and YelpReviewFull Mocked Unit Tests (#1567)
vcm2114 Feb 8, 2022
da34de2
strips newlines and fixes test. (#1593)
erip Feb 8, 2022
efaf7ce
Add CoNLL2000Checking Mocked Unit Test (#1570)
vcm2114 Feb 8, 2022
acd33bd
Add WikiText103 and WikiText2 Mocked Unit Tests (#1592)
vcm2114 Feb 8, 2022
31434ff
Add IMDB Mocked Unit Test (#1579)
Nayef211 Feb 8, 2022
ccf021f
Adding secondary caching to datasets (#1594)
Nayef211 Feb 8, 2022
8c65757
[BUG-FIX] Fixing Conll2000Chunking Test (#1595)
Nayef211 Feb 9, 2022
18b61fa
[BC-breaking] remove unnecessary split argument from datasets (#1591)
parmeet Feb 9, 2022
7b7a90d
IWSLT testing to start from compressed file (#1596)
parmeet Feb 10, 2022
c3f59a5
Add Mock test for IWSLT2017 dataset (#1598)
parmeet Feb 10, 2022
08f49f9
Parameterize tests for similar datasets (#1600)
Nayef211 Feb 10, 2022
9686e0d
Updated formatting for datasets (#1602)
Nayef211 Feb 11, 2022
e710e3a
[FORMATTING] Update formatting for dataset tests (#1601)
Nayef211 Feb 11, 2022
eb61b3f
Update docs for Machine Translation, Sequence Tagging, Question Answe…
parmeet Feb 11, 2022
f6830c5
[DOCS] Added docs for text classification and language modeling datas…
Nayef211 Feb 11, 2022
d40c375
remove private functions whose responsibilities have been subsumed by…
erip Feb 12, 2022
2e93d94
generate unicode strings to test utf-8 handling for all non-IWSLT dat…
erip Feb 12, 2022
7209421
Updated split arg. Added docs for SST2 and CC100 (#1604)
Nayef211 Feb 12, 2022
7937703
Add missing quotation marks to to CLIPTokenizer docs (#1610)
ProGamerGov Feb 14, 2022
8808e7e
Remove _add_docstring_header decorator from amazon review polarity (#…
Nayef211 Feb 14, 2022
c31a400
prepare repo for auto-formatters (#1546)
pmeier Feb 17, 2022
ebabe82
Fix flake8 issues introduced as a result of #1546 (#1617)
abhinavarora Feb 17, 2022
16acc71
Revert "Attempting to fix version conflict in CI (#1520)" (#1619)
parmeet Feb 17, 2022
f2da1b8
prepare datasets for new encoding kwarg. (#1616)
erip Feb 19, 2022
81212ba
Modify CLIPTokenizer to either infer number of merges from encoder js…
abhinavarora Feb 22, 2022
7dcdbc9
Fix for windows builds with python 3.10 , getting rid of ssize_t (#1627)
atalman Feb 23, 2022
ea16b6a
Build and test py3.10 (#1625)
atalman Feb 23, 2022
935008e
bump version to 0.13 (#1633)
parmeet Mar 1, 2022
e6ba477
update doc conf (#1634)
parmeet Mar 2, 2022
69f67f3
[BC-breaking] rename Roberta Bundle (#1635)
parmeet Mar 2, 2022
ec364a2
adding data pipelines for Roberta pre-processing (#1637)
parmeet Mar 9, 2022
2dd5e25
Add unicode generation to IWSLT tests (followup to #1608) (#1642)
Nayef211 Mar 9, 2022
4b821f4
remove models from experimental (#1643)
parmeet Mar 10, 2022
62915ad
fix roberta bundle example doc (#1648)
parmeet Mar 10, 2022
fcbe057
remove install instructions (#1641)
parmeet Mar 10, 2022
9abe9d0
remove experimental functional (#1645)
parmeet Mar 10, 2022
2ab3b29
remove experimental datasets (#1646)
parmeet Mar 10, 2022
02d0fc6
clean-up examples folder (#1647)
parmeet Mar 10, 2022
1ce1fab
update README (#1652)
parmeet Mar 11, 2022
8b08b87
clean-up stale code (#1654)
parmeet Mar 13, 2022
67e0e96
fix logger issue (#1656)
parmeet Mar 14, 2022
142b1ef
udpate download hooks (#1657)
parmeet Mar 14, 2022
e7cb2f4
replace git+git with git+https in requirements.txt (#1658)
ptrblck Mar 15, 2022
f8ce695
Add more test coverage (#1653)
vcm2114 Mar 22, 2022
77f3363
Install torchdata from nightly release in CI (#1664)
NivekT Mar 28, 2022
c821b89
Update README around installing LTS version (#1665)
Nayef211 Mar 28, 2022
d4656e2
Updating TorchData DataPipe API usages (#1663)
NivekT Mar 29, 2022
edc03bd
Resolve issues in #1653 + sanitize test names generated by nested_par…
vcm2114 Mar 30, 2022
9fc9077
Pin Jinja2 version to fix broken doc build (#1669)
Nayef211 Apr 1, 2022
c8ac763
Fixed formatting for all files using pre-commit (#1670)
Nayef211 Apr 1, 2022
5142463
[CMake 1/3] Update C++ includes to use imports relative to root direc…
Nayef211 Apr 4, 2022
b710c88
Adding MacOS unit tests on CircleCI (#1672)
Nayef211 Apr 4, 2022
55b7413
update requirements (#1675)
parmeet Apr 5, 2022
e791102
remove caching artifacts for datasets and fix it for vectors (#1674)
parmeet Apr 5, 2022
fae8e8c
torchx integration (#1679)
parmeet Apr 11, 2022
e1d4948
[CMake 2/3] Add CMake Build to torchtext to create single `_torchtext…
Nayef211 Apr 19, 2022
38f520c
Add pad transform, string to int transform (#1683)
ebsmothers Apr 21, 2022
5b8a25f
fix pad transform test (#1688)
parmeet Apr 26, 2022
3e25279
Fix smoke tests for linux (#1687)
atalman Apr 27, 2022
13fa5a5
[CMake 3/3] Split source files with Python dependency to separate lib…
Nayef211 Apr 27, 2022
1f30933
[fbsync] BetterTransformer support for torchtext (#1690) (#1694)
parmeet May 2, 2022
ed44dc8
[fbsync] Kill to_better by having native load_from_state_dict and ini…
parmeet May 2, 2022
88b251f
Provide option to return splited tokens (#1698)
parmeet May 3, 2022
dccb84f
[fbsync] Remove unneeded modules after using nn.Module for BetterTran…
parmeet May 3, 2022
7bc0071
Fix non-deterministic test failures for IWSLT (#1699)
Nayef211 May 4, 2022
8889f9c
[fbsync] Replace TransformerEncoder in torchtext with better transfor…
parmeet May 6, 2022
27ccd7e
Fix minor mistake (#1706)
MohamedAliRashad May 9, 2022
b685f32
Model benchmark (#1697)
parmeet May 10, 2022
4b4d50b
Updating dataset code to avoid creating multiple iterators from a Dat…
NivekT May 11, 2022
dfb53af
Replacing lambda functions with regular functions in all datasets (#1…
NivekT May 11, 2022
6689502
Enable model testing in FBCode (#1720)
parmeet May 12, 2022
ab76a04
Updating sst2 tutorial to replace lambda usage (#1722)
NivekT May 13, 2022
322cf2b
For Datasets, refactor local functions to be global so that they can …
NivekT May 16, 2022
03b8a05
Remove MACOSX_DEPLOYMENT_TARGET (#1728)
parmeet May 17, 2022
88086d9
fix docs build (#1730)
parmeet May 18, 2022
2a712f4
Add Shuffle and sharding datapipes to datasets (#1729)
parmeet May 18, 2022
ec20f88
Add support for CoLA dataset with unit tests (#1711)
vcm2114 May 18, 2022
bb41e4f
Add support for MRPC dataset with unit tests (#1712)
vcm2114 May 18, 2022
bd0f765
Add support for QQP dataset with unit tests (#1713)
vcm2114 May 18, 2022
e631624
Add support for MNLI dataset with unit tests (#1715)
vcm2114 May 18, 2022
c5f08db
Add support for STS-B dataset with unit tests (#1714)
vcm2114 May 19, 2022
996d9f8
Delete prints in test_qqp.py (#1734)
vcm2114 May 19, 2022
caaa8e3
Remove redundant dataname in test_shuffle_shard_wrapper (#1733)
parmeet May 20, 2022
e548d3f
Adding parameterized dataset pickling tests (#1732)
Nayef211 May 23, 2022
8bbb6ac
Fix STSB and WikiTexts tests (#1737)
parmeet May 23, 2022
da509e1
Add support for Scriptable BERT tokenizer (#1707)
parmeet May 25, 2022
59f4bee
Fix docstring for Tokenizers (#1739)
parmeet May 25, 2022
70fc104
Change root directory for datasets (#1740)
parmeet May 25, 2022
f2fdae2
Take TORCH_HOME env variable into account while setting the cache dir…
parmeet May 26, 2022
fe57328
Fix doc js initialization (#1736)
mthrok May 26, 2022
d963bdb
Pin setuptools to 58.0.4 on Windows (#1746)
Nayef211 May 26, 2022
65f774b
Fix Mock tests due to change in datasets directory (#1749)
parmeet May 27, 2022
b7b99bd
add test for shuffle before shard (#1738)
parmeet May 31, 2022
9411d21
Use _get_torch_home standard utility from torch hub (#1752)
parmeet May 31, 2022
235c9f7
Removed ticks (``) from the url under is_module_available (#1753)
suvigyajain0101 May 31, 2022
8722c8d
add header info for BERT tokenizer (#1754)
parmeet Jun 1, 2022
dc7325a
Add support for QNLI dataset with unit tests (#1717)
vcm2114 Jun 1, 2022
932d776
Add support for RTE dataset with unit tests (#1721)
vcm2114 Jun 1, 2022
73bf4fa
Add support for WNLI dataset with unit tests (#1724)
vcm2114 Jun 1, 2022
60d9d51
Adding support for batch input in BERT Tokenizer with perf benchmark …
parmeet Jun 1, 2022
814aa7e
Added post install script for pywin32 (#1748)
Nayef211 Jun 1, 2022
cb8475e
Add contributing guidelines for third party and custom C++ operators …
Nayef211 Jun 1, 2022
1c1e823
Fix test utils (#1757)
parmeet Jun 2, 2022
2978507
Add recommendations regarding use of datapipes for multi-processing, …
NicolasHug Jun 2, 2022
6aef9c9
Fix bug in RTE and WNLI testing (#1759)
parmeet Jun 3, 2022
ab33a61
Fix bug in QNLI dataset and corresponding test (#1760)
Nayef211 Jun 3, 2022
d6e3550
Fix windows utils test (#1761)
parmeet Jun 3, 2022
cfd1ae2
fix missed mask arg in torchtext transformer (#1758)
erichan1 Jun 6, 2022
d774047
Update nightly version to 0.14 (#1769)
Nayef211 Jun 7, 2022
6e775d1
Migrate RegexTokenizer from experimental/transforms.py to transforms.…
pmabbo13 Jun 7, 2022
1e9d731
Add test to compare encoder inference on input with and without paddi…
erichan1 Jun 7, 2022
87b397a
Pinning Utf8proc version (#1771)
parmeet Jun 8, 2022
04fea7a
Refactor TorchText version handing and adding first version of M1 bui…
atalman Jun 8, 2022
e2fa8d8
Add m1 tagged build for torchtext (#1776)
atalman Jun 10, 2022
d616f07
Add missing type hints (#1782)
ProGamerGov Jun 13, 2022
4f53eb8
Fix typo in nightly branch ref (#1783)
atalman Jun 13, 2022
c4d379e
sharing -> sharding (#1787)
Jun 14, 2022
19dc51c
Migrate MaskTransform from internal to experimental/transforms.py (#1…
pmabbo13 Jun 14, 2022
5b04e74
Making sure we build correctly against release branch (#1790)
atalman Jun 15, 2022
8b35599
Fix push on release reference name (#1792)
atalman Jun 15, 2022
a54be1f
Adding the conda builds for m1 (#1794)
atalman Jun 17, 2022
5f48259
Add benchmark for roberta prepoc pipelines (#1684)
parmeet Jun 17, 2022
a937288
remove padding mask for input embeddings (#1799)
parmeet Jun 21, 2022
a6eb3b7
Add CNN-DM dataset to torchtext (#1789)
pmabbo13 Jun 22, 2022
de07050
Adding Benchmark for TA ops (#1801)
parmeet Jun 23, 2022
6d6a14a
Automatically initialize submodule (#1805)
mthrok Jun 23, 2022
e023ce1
Rename build_tools dir to tools dir (#1804)
mthrok Jun 23, 2022
31eabf8
converting experimental to prototype (#1803)
parmeet Jun 23, 2022
238c414
Add Datasets contribution guidelines (#1798)
parmeet Jun 26, 2022
7cc1807
Adding benchmarks for add tokens operator (#1807)
Nayef211 Jun 27, 2022
583c5b2
Fixing incorrect inputs to add eos and bos operators (#1810)
Nayef211 Jun 27, 2022
d37bb52
Update compatibility matrix for 0.13 release (#1802)
Nayef211 Jun 28, 2022
81c1d75
[Docs] Updating usage docs for Regex Tokenizer (#1815)
Nayef211 Jul 6, 2022
c7e5a07
Fix Multi30k dataset urls (#1816)
Nayef211 Jul 6, 2022
a4a81d2
Update CircleCI Xcode image (#1818)
mthrok Jul 7, 2022
e1c7bc6
Define TORCHTEXT_API macro for visibility control (#1806)
mthrok Jul 7, 2022
cf94d30
Add libtorchtext cpp example (#1817)
Nayef211 Jul 8, 2022
53664ff
Fixing build when CUDA enabled torch is installed (#1814)
Nayef211 Jul 8, 2022
67bb7fc
Use TA functional for adding tokens to the beginning and end of input…
parmeet Jul 11, 2022
1c71947
Add TA Tensor creation operation to the benchmark (#1836)
parmeet Jul 14, 2022
b794794
Remove sphinx_rtd_theme from requirements.txt (#1837)
Nayef211 Jul 14, 2022
5910ea7
Set MACOSX_DEPLOYMENT_TARGET=10.9 for binary job (#1835)
mthrok Jul 14, 2022
3549a50
Remove future dep from windows (#1838)
Nayef211 Jul 14, 2022
5ce9c42
Cache CNNDM extraction and optimize reading in filenames (#1809)
pmabbo13 Jul 14, 2022
e964051
Modify get_local_asset_path to take overwrite option and use it in BE…
parmeet Jul 14, 2022
bb58f6e
fix OBO error for vocab files with empty lines (#1841)
parmeet Jul 18, 2022
8d56ed2
Add T5 Model to TorchText (#1845)
pmabbo13 Jul 18, 2022
ca2e5a4
Bundler API for TorchText T5 Model (#1846)
pmabbo13 Jul 19, 2022
ed69973
Testing T5Model (#1848)
pmabbo13 Jul 21, 2022
4fb43aa
Torcharrow based training using RoBERTa model and SST2 classification…
parmeet Jul 21, 2022
af03d71
Convert TA transform module to prepoc function (#1854)
parmeet Jul 22, 2022
e114e98
T5Transform text pre-processing for t5 model (#1852)
pmabbo13 Jul 22, 2022
4a5f11c
Add text transform to T5 bundler (#1856)
pmabbo13 Jul 25, 2022
f450271
Prepare T5 Model for Language Generation (#1862)
pmabbo13 Jul 28, 2022
ef0808f
Correct typo in SST-2 tutorial (#1865)
joshsmith2 Aug 1, 2022
103551f
ci: Use self hosted runners for build (#1851)
seemethere Aug 1, 2022
2755797
Fix docstring type (#1867)
ProGamerGov Aug 2, 2022
466f2e2
Tutorial on using T5 model for text summarization (#1864)
pmabbo13 Aug 2, 2022
255f4f7
Add missing None type hint to tests (#1868)
ProGamerGov Aug 2, 2022
8eb0561
Make BERT benchmark code more robust (#1871)
parmeet Aug 3, 2022
2bb2562
Updating T5 demo to use beam search for generator (#1869)
pmabbo13 Aug 3, 2022
eaed744
Add torchdata to testing requirements in requirements.txt (#1874)
ronakmal Aug 8, 2022
e1b6984
Demo T5 model on sentiment classification and translation (#1872)
pmabbo13 Aug 8, 2022
e7bcf3c
Make T5 model torchscriptable (#1876)
pmabbo13 Aug 11, 2022
5a351b4
[WIP] wrapper class for end-to-end t5 model (#1880)
pmabbo13 Aug 15, 2022
2fd12f3
Update xcode version to 14.0 in CI (#1881)
Nayef211 Aug 15, 2022
3a0d0a3
Graduate MaskTransform from prototype (#1882)
joecummings Aug 17, 2022
72966f0
Add small, large, 3b, 11b pre-trained weights for t5 (#1879)
pmabbo13 Aug 17, 2022
0225abe
Allow CNNDM to be imported from torchtext.datasets (#1884)
pmabbo13 Aug 19, 2022
6a43bd5
Remove dependency on the torch::jit::script::Module for mobile builds…
joecummings Aug 31, 2022
5087134
Create pytest fixture to auto delete model checkpoints within integra…
Nayef211 Sep 1, 2022
d259856
Make comment paths dynamic (#1894)
joecummings Sep 1, 2022
1b5edec
Move Spacy from Pip dependencies to Conda dependencies (#1890)
joecummings Sep 3, 2022
72ecc0c
Usage setup-minicoda action for m1 build (#1897)
atalman Sep 13, 2022
dfac1ee
Disable `test_vocab_from_raw_text_file` on Linux (#1901)
Nayef211 Sep 15, 2022
6c1708f
Turn off mask checking for torchtext which is known to have a legal m…
Nayef211 Sep 15, 2022
94c53cb
Update doc theme to the latest (#1899)
Nayef211 Sep 16, 2022
67d2692
Fix test execution in torchtext (#1889)
Nayef211 Sep 16, 2022
b0df58b
Add never_split feature to BERTTokenizer (#1898)
reachsumit Sep 19, 2022
befea6e
Add missing Cmake file for in tokenizer dir (#1908)
Nayef211 Sep 20, 2022
9b06d56
Fix Sphinx-gallery display and pin sphinx-related packages (#1907)
Nayef211 Sep 20, 2022
766cf9d
Resolve and remove TODO comments (#1912)
Nayef211 Sep 27, 2022
5c48f4a
Avoid looping through the whole counter in bleu_score method (#1913)
Asugawara Sep 27, 2022
52436c8
Resolve inconsistency in IMDB label output (#1914)
Nayef211 Sep 27, 2022
258a356
Add decoding capability to GPT2BPE tokenizer (#1919)
reachsumit Oct 3, 2022
3f9c349
[Feature] Added capability to add special tokens in GPT2BPEEncoder an…
reachsumit Oct 4, 2022
de54db6
[Feature] Add ability to load HF checkpoints into T5 model (#1918)
joecummings Oct 5, 2022
ff1fdfc
Updating usage of torch.utils.data.graph.traverse in test case (#1927)
NivekT Oct 5, 2022
0026773
[CI] Fix upload channel (#1932)
atalman Oct 6, 2022
6ffe7be
Avoid using std::regex and fix lint errors (#1930)
reachsumit Oct 10, 2022
c776dc1
Update dataset RTE information (#1934)
Geaming2002 Oct 11, 2022
4d88d4e
Revert "[CI] Fix upload channel (#1932)" (#1939)
atalman Oct 11, 2022
fb51653
Fixed on_disk_cache issues (#1942) (#1945)
joecummings Oct 14, 2022
238b342
Update decoding logic to handle special tokens (#1925)
reachsumit Oct 17, 2022
4570a56
Fix device mismatch bug in T5 implementation (#1944)
joecummings Oct 17, 2022
5eb33ce
Add Character Level BPE Tokenizer (#1936) (#1946)
rshraga Oct 18, 2022
db987ed
Merge branch 'main' into merge_main_to_fbsync
Nayef211 Oct 18, 2022
1aaaf3e
Merge branch 'fbsync' into merge_main_to_fbsync
Nayef211 Oct 19, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view

These merge commits were added into this branch cleanly.

There are no new changes to show.