We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问下OFA是用的那个版本的VQAGAN模型?可否上传下checpont和config.yaml文件或者提供下链接?
The text was updated successfully, but these errors were encountered:
我用的你给的checkpoint zipfile image_gen_large_best.zip中的vqgan/last.ckpt和vqgan/model.yaml,但是这样对256x256编码成token sequence的长度是32x32=1024而不是文中说的16x16=256。 请问是哪里的问题?
image_gen_large_best.zip
Sorry, something went wrong.
或者请问这里的code sequence(长度1024)对应的图片的resolution是多少?256吗?
@PhoebusSi 直接对256x256编码那确实是1024长度。预训练时做的是image infilling,即还原图像中间部分的code,图像中部(128x128分辨率)编码出来的长度才是256
No branches or pull requests
请问下OFA是用的那个版本的VQAGAN模型?可否上传下checpont和config.yaml文件或者提供下链接?
The text was updated successfully, but these errors were encountered: