questions about stage 1 training #6

iyuner · 2023-07-27T14:58:06Z

Hi,

Thank you for sharing your code. As mentioned in your paper, there are 2 stages of training. First for the denoising diffusion model, and second focuses on the leapfrog initializer. It seems the repo provides the code of stage 2 training, which loaded the pretraining checkpoint of the denoising diffusion model directly. Could you also provide the code for stage 1 training? Do you use the leapfrog initializer in the first stage? If so, what are the initialized values of the estimated mean, variance, and sample prediction you used? Thanks!

Frank-Star-fn · 2023-11-07T03:10:32Z

I also have the same requirement and hope to obtain the code for the first stage of training.

kkk00714 · 2024-01-03T13:15:10Z

I have reimplemented the stage 1 training. If you are still interesting in it, please concat me.

kkk00714 · 2024-01-03T13:16:01Z

I have reimplemented the stage 1 training. If you are still interesting in it, please concat me.

fangzl123 · 2024-01-09T12:56:18Z

Hi, have you successfully trained stage 1? I've reimplemented it but when I trained the model, the noise estimation loss will always stuck around 1.0. Thanks for any insights.

kkk00714 · 2024-01-09T13:22:42Z

Try to change the fut_traj size to (b,1,2) to train, and save the model to train again with size(b,T,2) and batchsize 250. The loss will stuck around 0.12 which is closed to 0.06 of pretrained model.

ShaokangHi · 2024-01-10T11:59:42Z

Hi, do you mean to implement stage 1 (pre-train model) only need to change the shape type? @woyoudian2gou

kkk00714 · 2024-01-10T13:09:10Z

Yes, if you follow the steps I described above, you will get a model that is close to pre-train model. Time step T and batchsize are both factors that affect training.

ShaokangHi · 2024-01-12T12:09:38Z

@woyoudian2gou Thank you for your reply. It will change config(cfg):

past_frames : 29
future_frames: 1
min_past_frames: 29
min_future_frames: 1

Also change these related parameters in ./trainer/train_led_trajectory_augment_input.py, right?

Looking forward to your reply! or could you share your related code via Google drive or other cloud? Thanks in advance!

kkk00714 · 2024-01-12T12:16:01Z

No, you should change the shape of fut_traj like Loss_NE(past_traj,fut_traj[:,0,:].unsqueeze(1),traj_mask). And once you have trained this model, change the batchsize to 250 and use origin shape of fut_traj to continue training.

packer-c · 2024-01-14T11:45:16Z

@woyoudian2gou Hello, What does Loss_NE( ) mean? Where can I find this code? thanks.

13629281511 · 2024-01-17T03:09:34Z

Hello, i am confused about how to reimplenment the stage 1 training, would you please leave me a contact information?

percybuttons · 2024-05-17T05:21:31Z

Hi,

Thank you very much for sharing your unique training method. I find it very interesting! However, I have a few questions for clarification:

You mentioned using fut_traj[:,0,:] during the first training of the diffusion module. Does this mean that only the first frame of fut_traj is used?
If so, how many epochs are required for the first training of the diffusion model?
For the second training of the diffusion model, when the full fut_traj is used, how many epochs are required?
Is the learning rate set the same as mentioned in the paper?
I am looking forward to your response and appreciate your @kkk00714
Best regards

kkk00714 · 2024-05-17T12:47:07Z

Yes.
2&3&4. Use hyperparameters as same as the paper mentioned and change batchsize to 250 in stage 1

percybuttons · 2024-05-17T13:55:38Z

Thank you very much for your response! It resolved my issue and provided immense help. Your suggestion to adjust the batch size from 10 to 250, processing 250*11 agents at a time, is indeed a sensible configuration for a diffusion model. However, my hardware might not support running such a large volume of data simultaneously. I will attempt to use a slightly smaller batch size. Once again, I appreciate your reply! @kkk00714

kkk00714 · 2024-05-17T14:08:12Z

I hope you can successfully replicate the stage 1 training process. The inspiration for changing the size of future_traj from (b, T, 2) to (b, 1,2) came from the fact that I found that the noise value of the same sample was almost exactly the same at all 30 time steps. I hope this will help you with your subsequent adjustments.

percybuttons · 2024-05-17T14:16:50Z

Thank you for your kind words and positive outlook! It truly is an intriguing finding, and your ability to implement it effectively showcases your talent. This discovery may not be coincidental at all; it's possible that this approach could be universally applied across diffusion models to yield even better performing ones. Once again, I appreciate your response and insight—it's invaluable for further advancements.

VanHelen · 2024-07-09T14:36:30Z

Hello, I would like to know how you implemented the first stage of denoising training. Did you use the LED module in the first stage of training? Thank you very much!

kkk00714 · 2024-07-09T18:22:49Z

As the paper described, LED module is not used in the first training stage. You just need to use the loss_ne function that comes with the author's code to train fut_traj after changing its shape as I described earlier.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about stage 1 training #6

questions about stage 1 training #6

iyuner commented Jul 27, 2023

Frank-Star-fn commented Nov 7, 2023

kkk00714 commented Jan 3, 2024

kkk00714 commented Jan 3, 2024

fangzl123 commented Jan 9, 2024

kkk00714 commented Jan 9, 2024

ShaokangHi commented Jan 10, 2024

kkk00714 commented Jan 10, 2024

ShaokangHi commented Jan 12, 2024

kkk00714 commented Jan 12, 2024

packer-c commented Jan 14, 2024

13629281511 commented Jan 17, 2024

percybuttons commented May 17, 2024

kkk00714 commented May 17, 2024

percybuttons commented May 17, 2024

kkk00714 commented May 17, 2024

percybuttons commented May 17, 2024

VanHelen commented Jul 9, 2024

kkk00714 commented Jul 9, 2024

questions about stage 1 training #6

questions about stage 1 training #6

Comments

iyuner commented Jul 27, 2023

Frank-Star-fn commented Nov 7, 2023

kkk00714 commented Jan 3, 2024

kkk00714 commented Jan 3, 2024

fangzl123 commented Jan 9, 2024

kkk00714 commented Jan 9, 2024

ShaokangHi commented Jan 10, 2024

kkk00714 commented Jan 10, 2024

ShaokangHi commented Jan 12, 2024

kkk00714 commented Jan 12, 2024

packer-c commented Jan 14, 2024

13629281511 commented Jan 17, 2024

percybuttons commented May 17, 2024

kkk00714 commented May 17, 2024

percybuttons commented May 17, 2024

kkk00714 commented May 17, 2024

percybuttons commented May 17, 2024

VanHelen commented Jul 9, 2024

kkk00714 commented Jul 9, 2024