-
Couldn't load subscription status.
- Fork 203
Open
Description
I noticed when training causal models with RAVE v2 that the validation audio sounds pretty bad. If I'm understanding correctly, it's because V2 crops to the valid (as in convolution) portion of the signal, so the part of the reconstruction which is affected by zero padding (~2/3 of it with v2 defaults) is not trained at all. But validation_step doesn't do the same cropping, so the validation curve looks very noisy and the audio sounds bad in tensorboard.
Would it make sense to include the same cropping in validation_step?
Metadata
Metadata
Assignees
Labels
No labels