Which datasets should torchaudio have?

Which new datasets should we offer and prioritize in torchaudio?

I want to follow-up on #31 and a few of the recent PRs. Instead of aiming to have an exhaustive list of datasets, we should focus on a few important/common/representative dataset that can serve as templates for users to easily implement datasets of their choosing. All datasets should already be free/accessible/online/common with license permitting linking to them.

torchaudio currently has:
1. commonvoice
1. librispeech
1. ljspeech
1. speechcommands
1. vctk
1. yesno

Current open proposals:
* Free Universal Sound Separation (FUSS) #534
* CMU_ARCTIC #512

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Which datasets should torchaudio have? #550

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Which datasets should torchaudio have? #550

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions