Convert CommonVoice tartar training file from mp3 to wav (8kHz), and remove unused file #772

engineerchuan · 2020-07-11T13:10:02Z

Used 'sox -i ' with default 14.4.2 options

soxi output for the wave file I created:

Input File     : 'assets/CommonVoice/cv-corpus-4-2019-12-10/tt/clips/common_voice_tt_00000000.wav'
Channels       : 1
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:05.00 = 220500 samples = 375 CDDA sectors
File Size      : 441k
Bit Rate       : 706k
Sample Encoding: 16-bit Signed Integer PCM

I wasn't sure about whether we need to change Line 111 of torchaudio/datasets/commonvoice.py.

class COMMONVOICE(Dataset):
    """
    Create a Dataset for CommonVoice. Each item is a tuple of the form:
    (waveform, sample_rate, dictionary)
    where dictionary is a dictionary built from the tsv file with the following keys:
    client_id, path, sentence, up_votes, down_votes, age, gender, accent.
    """

    _ext_txt = ".txt"
    _ext_audio = ".mp3"
    _folder_audio = "clips"

Used 'sox -i <mp3> <wav>' with default 14.4.2 options

Issue also mentioned test.wav accidentally checked in but this does not seem to be there.

codecov · 2020-07-11T13:32:14Z

Codecov Report

Merging #772 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #772   +/-   ##
=======================================
  Coverage   89.53%   89.53%           
=======================================
  Files          32       32           
  Lines        2617     2617           
=======================================
  Hits         2343     2343           
  Misses        274      274

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c375490...9d0dc78. Read the comment docs.

mthrok · 2020-07-11T13:47:47Z

Hi @engineerchuan

Thanks for working on this.

Looks like waves_yesno/0_1_0_1_0_1_1_0.wav and one more file (not sure which one but the one used for test_gtzan) are actually used by test, which I overlooked. Could you put them ?

I wasn't sure about whether we need to change Line 111 of torchaudio/datasets/commonvoice.py.

We should be only changing test, so torchaudio/datasets/commonvoice.py should stay the same. Did you have an issue that way?

mthrok · 2020-07-11T13:52:26Z

Also, can you try reduce the sampling rate of the new wav file (assets/CommonVoice/cv-corpus-4-2019-12-10/tt/clips/common_voice_tt_00000000.wav.) to 8000 Hz? The quality of the audio should not matter to tests and we would like to keep them as small as possible.

mthrok

Looks good.

Thanks!

mthrok · 2020-07-12T01:58:22Z

Followup: Remove https://github.com/pytorch/audio/blob/master/test/test_datasets.py#L51 and https://github.com/pytorch/audio/blob/master/test/test_datasets.py#L49 to see if this test works on Windows.

@engineerchuan If you have time, would you like to try ^ ?

engineerchuan · 2020-07-12T12:57:21Z

Followup: Remove https://github.com/pytorch/audio/blob/master/test/test_datasets.py#L51 and https://github.com/pytorch/audio/blob/master/test/test_datasets.py#L49 to see if this test works on Windows.

@engineerchuan If you have time, would you like to try ^ ?

Yes I will try.

engineerchuan added 2 commits July 11, 2020 09:06

Convert CommonVoice tartar training file from mp3 to wav (44.1kHz)

f8582ee

Used 'sox -i <mp3> <wav>' with default 14.4.2 options

Issue 764, remove 3 unused audio files

ef32a9e

Issue also mentioned test.wav accidentally checked in but this does not seem to be there.

engineerchuan changed the title ~~Convert CommonVoice tartar training file from mp3 to wav (44.1kHz)~~ Convert CommonVoice tartar training file from mp3 to wav (8kHz), and remove unused file Jul 11, 2020

engineerchuan added 2 commits July 11, 2020 12:44

add back two used files

96552f0

converted CommonVoice tartar mp3 to wav using rate 8000 Hz

9d0dc78

mthrok approved these changes Jul 12, 2020

View reviewed changes

mthrok merged commit 26941fa into pytorch:master Jul 12, 2020

engineerchuan deleted the issue_764_remove_insignificant_test_assets_try_2 branch July 12, 2020 12:56

This was referenced Jul 12, 2020

Follow on issue 764, remove sox backend from TestCommonVoice #774

Closed

Remove sox backend for one test, followup from Issue 764 #775

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert CommonVoice tartar training file from mp3 to wav (8kHz), and remove unused file #772

Convert CommonVoice tartar training file from mp3 to wav (8kHz), and remove unused file #772

Uh oh!

engineerchuan commented Jul 11, 2020

Uh oh!

codecov bot commented Jul 11, 2020 •

edited

Loading

Uh oh!

mthrok commented Jul 11, 2020

Uh oh!

mthrok commented Jul 11, 2020

Uh oh!

mthrok left a comment

Uh oh!

mthrok commented Jul 12, 2020

Uh oh!

engineerchuan commented Jul 12, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Convert CommonVoice tartar training file from mp3 to wav (8kHz), and remove unused file #772

Convert CommonVoice tartar training file from mp3 to wav (8kHz), and remove unused file #772

Uh oh!

Conversation

engineerchuan commented Jul 11, 2020

Uh oh!

codecov bot commented Jul 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mthrok commented Jul 11, 2020

Uh oh!

mthrok commented Jul 11, 2020

Uh oh!

mthrok left a comment

Choose a reason for hiding this comment

Uh oh!

mthrok commented Jul 12, 2020

Uh oh!

engineerchuan commented Jul 12, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Jul 11, 2020 •

edited

Loading