Skip to content

DDSP components in TorchAudio #2835

Open
@mthrok

Description

@mthrok

Differential Digital Signal Processing (DDSP) is a technique proposed by the Google Magenta team. [repo, papers].

It provides powerful generative methods that take advantage of the structure of sound based on Fourier analysis.
The paper has over 200 citations and is being applied to different tasks like speech synthesis and source separation.

TorchAudio is looking to support DDSP-based modeling methods by adding basic components.
We would like to hear and get help from the community on this work stream.

Initial Goal

To support DDSP-based generative modeling.

New Components / APIs

  1. Harmonic Synthesizer / Additive Synthesis
  2. Subtractive Synthesis

Components for DDSP Modeling from [1]

  1. Multi-scale mel spectrogram loss
    Used to train models. Compare the melspectrogram of the original waveform and synthesized waveform at multiple scales. [reference impl]
  2. exp_sigmoid
    Modified sigmoid function used to ensure that model output is in the value range expected by synthesizers [reference impl]
  3. [Nice to have] Dataset
  4. [Stretch] DDSP Model from [1]
    • AutoEncoder model for Timbre transfer
  5. [Stretch] Training Script and pre-trained model
    Script to train models for
    • LJSpeech
    • Solo violin

Existing Components / Workstream

Existing Tutorials

References

  1. [2001.04643] DDSP: Differentiable Digital Signal Processing

and some random papers moto found interesting

  1. [2010.15084] Speech Synthesis and Control Using Differentiable DSP
  2. [2202.00200] Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds
  3. [2210.14476] Sinusoidal Frequency Estimation by Gradient Descent

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions