Downsample and Upsample ratio for T dimension #27

54wb · 2024-02-04T13:50:25Z

Hi, Thanks for your great work. When I train the mmnist dataset with a downsampling parameter of 4, 8，8， mmnist dataset's input and output in T-dimension are both 10, so the encoder changes the T-dimension to 2, so the decoder can't return the dimension to 10, I want to know if T-dimension of dataset is not a power of 2 what should I do with it?

songweige · 2024-02-04T22:19:41Z

Hey @54wb, thank you for your question! Normally I used videos with 16 frames, the downsampling of 4 in temporal dimension would give 4 frames in the latent. So for MMNIST probably you could try 2 8 8, which gives a similar number of frames (5) in the latent?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Downsample and Upsample ratio for T dimension #27

Downsample and Upsample ratio for T dimension #27

54wb commented Feb 4, 2024

songweige commented Feb 4, 2024

Downsample and Upsample ratio for T dimension #27

Downsample and Upsample ratio for T dimension #27

Comments

54wb commented Feb 4, 2024

songweige commented Feb 4, 2024