Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

extract_loudness: Fixing #31 #32

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

nielsrolf
Copy link

Here is a version of extract_loudness that is almost a 1:1 copy of the fixed version of the magenta/ddsp compute_loudness. The changes are mostly:

  • take mean in amplitude scale, then convert back to db
  • do everything in PyTorch instead of librosa/numpy, thereby making it differentiable
    In contrast to magenta/ddsp, this version does not have dynamic range and ref_db, but I don't understand the concepts behind those and doubt that they are necessary.
    There seem to be small numerical errors that prevent the curve from being 100% smooth, but this is something one can live with I hope:
    image

…ting and make the function differentiable
assert (block_size is None) != (frame_rate is None), "Specify exactly one of block_size or frame_rate"

if frame_rate is not None:
block_size = sample_rate // frame_rate

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

minor thingy: should be sampling_rate here instead?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants