Closed
Description
Hello,
Thank you for sharing your work!
I'm thinking of preparing a training script based on #27
Regarding data, I have access to DL3DV (with the COLMAP files as well). Here are some questions I have:
- Did you convert DL3DV to WebDataset format?
- Did you use a particular partition of the DL3DV 10K scenes for train/test? I see in section 4.1. you mention DL3DV140 for evaluation
- Did you directly use the COLMAP poses?
- The paper states that the cameras where normalized to [-2,2]. Are the cameras for the complete scene (all frames) normalized,and then the images sampled, or first the images are sampled and the cameras of those images are normalized?
- The paper says "We find it important to jointly sample I with a smaller sub-sampling stride to ensure sufficient temporal granularity and avoid missing critical transitions with a small probability (0.2 is used in practice)" Could you please clarify what this stride means for sampling the images?
Best regards