Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Could You Provide More Details About Pixel Controller? #8

Open
BlingHe opened this issue Jan 12, 2024 · 2 comments
Open

Comments

@BlingHe
Copy link

BlingHe commented Jan 12, 2024

Hi, the authers, thanks for sharing this great work!

I have some questions about the Pixel Controller.

As you mentioned in the paper, the pixel controller is responsible for integrating the fine-grained object appearance, the image latent of the reference image (named "ip_img" in your code) is introduced by concatenating it with other view images along the "batch" dimension. To the best of my knowledge, the new added "batch item" still need "noise" to achieve mse loss. Could you give more explanation about how to train this pixel controller module?

I will be very grateful for your. Looking forward to your reply!

@haodong2000
Copy link

I am curious about it, also. Unlike Wonder3D and MVDream that leverages multi-view attention without an explicit reference image latent, this pixel controller does need an appropriate process about the predicted noise over ref img latent during training.

@pengwangucla
Copy link

It is very simple, We may just ignored the loss over the last frame.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants