Dithering constant

Why do `torchaudio.compliance.kaldi.fbank` and `torchaudio.compliance.kaldi.spectrogram` have so large `dither` default parameter (=1.0)? It very often just noises full output.

It's common to use dither around 0, e.g 0.00001 in QuartzNet, Jasper -- near to SOTA ASR models (https://github.com/NVIDIA/NeMo/blob/master/examples/asr/configs/quartznet15x5.yaml).

I want to notice that even in torchaudio tutorial we have dither = 0.0: https://pytorch.org/tutorials/beginner/audio_preprocessing_tutorial.html.

Also look at this issue and how it was resolved: https://github.com/pytorch/audio/issues/157


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dithering constant #371

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Dithering constant #371

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions