Epoch counter does not resume when resuming from start checkpoint. #26

dillfrescott · 2024-05-05T19:42:23Z

It seems to reset to 0 every time

ZFTurbo · 2024-05-05T20:37:57Z

Yes, it's not saved in model data anywhere. Actually config can be saved inside too... I need to think what to save.

It's not actually an error, just not enough functionality.

dillfrescott · 2024-05-05T20:39:48Z

Gotcha. I've been manually adjusting the "for epoch in range" values in train.py every resume which works i guess.

jarredou · 2024-05-06T00:06:03Z

I've started working on a more "resume-friendly" fork a while ago with the --resume CLI args, and saving optimizer, scheduler states + epoch, best_sdr and last training loss values within the "last_xxx.ckpt" saved model (+ wandb logging here).
main...jarredou:Music-Source-Separation-Training:wandb+resume

Code is not bulletproof.

dillfrescott · 2024-05-06T00:10:24Z

Ah, thank you! @jarredou

dillfrescott · 2024-05-06T02:00:06Z

You should do a PR for that

jarredou · 2024-05-07T17:48:14Z

It would require more work for a PR, like I said it's not bulletproof in its current state and can lead to some errors, but since few months, I don't have free time to spend on this, unfortunately.

dillfrescott · 2024-05-07T21:21:27Z

Ah, gotcha

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Epoch counter does not resume when resuming from start checkpoint. #26

Epoch counter does not resume when resuming from start checkpoint. #26

dillfrescott commented May 5, 2024 •

edited

Loading

ZFTurbo commented May 5, 2024

dillfrescott commented May 5, 2024

jarredou commented May 6, 2024 •

edited

Loading

dillfrescott commented May 6, 2024

dillfrescott commented May 6, 2024

jarredou commented May 7, 2024 •

edited

Loading

dillfrescott commented May 7, 2024

Epoch counter does not resume when resuming from start checkpoint. #26

Epoch counter does not resume when resuming from start checkpoint. #26

Comments

dillfrescott commented May 5, 2024 • edited Loading

ZFTurbo commented May 5, 2024

dillfrescott commented May 5, 2024

jarredou commented May 6, 2024 • edited Loading

dillfrescott commented May 6, 2024

dillfrescott commented May 6, 2024

jarredou commented May 7, 2024 • edited Loading

dillfrescott commented May 7, 2024

dillfrescott commented May 5, 2024 •

edited

Loading

jarredou commented May 6, 2024 •

edited

Loading

jarredou commented May 7, 2024 •

edited

Loading