You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to train on custom dataset (where I have already prepared 1 - 1 image pairs and my seeds.json looks like this [["0000000", ["0"]], ["0000001", ["1"]], ... ) with 3x NVIDIA TITAN RTX 24GB. Initialization of all the models works fine but during validation sanity check I am getting this error:
...
[rank0]: File "/code/instruct-pix2pix/./stable_diffusion/ldm/models/diffusion/ddpm_edit.py", line 892, in forward
[rank0]: return self.p_losses(x, c, t, *args, **kwargs)
[rank0]: File "/code/instruct-pix2pix/./stable_diffusion/ldm/models/diffusion/ddpm_edit.py", line 1043, in p_losses
[rank0]: logvar_t = self.logvar[t].to(self.device)
[rank0]: RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu)
Do you know where it can come from? I did not change anything in source code. Just prepared the data and updated paths in train config.
Thanks in advance!
The text was updated successfully, but these errors were encountered:
Actually, I may know the reason, I think this error may not happen in specified cuda env, such as cu113. I reproduced ip2p last two month based on cu113, there is no such error, but recently I use H100 to train the data, and must use cu118+ in H100 gpus, and the error comes.
Hello,
I am trying to train on custom dataset (where I have already prepared 1 - 1 image pairs and my
seeds.json
looks like this[["0000000", ["0"]], ["0000001", ["1"]], ...
) with 3x NVIDIA TITAN RTX 24GB. Initialization of all the models works fine but during validation sanity check I am getting this error:Do you know where it can come from? I did not change anything in source code. Just prepared the data and updated paths in train config.
Thanks in advance!
The text was updated successfully, but these errors were encountered: