Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sudden stop while training #53

Open
kawaiitakagi opened this issue May 8, 2024 · 4 comments
Open

Sudden stop while training #53

kawaiitakagi opened this issue May 8, 2024 · 4 comments

Comments

@kawaiitakagi
Copy link

When I was training the three-view net, after some epoches( not a specific number, encountered at 7, 20, 90, or other numbers) the train process stopped, without any warnings and errors. The process just stopped and through tqdm model, I could see that the trainer stopped to read data from the dataset( tqdm displayed 0 it/s). I was using a SSH client, with an NVIDIA 4060 GPU.

@layumi
Copy link
Owner

layumi commented May 8, 2024

Hi @kawaiitakagi

Could you provide me more details? or try to use another machine?

@kawaiitakagi
Copy link
Author

Hi @kawaiitakagi

Could you provide me more details? or try to use another machine?

Thank you for your reply. I found that this issue was caused by several classes lacking Google images.

@layumi
Copy link
Owner

layumi commented May 14, 2024

Thank you @kawaiitakagi
I think I have fixed it by https://github.com/layumi/University1652-Baseline/blob/master/folder.py#L102
The issue still exists? Do you find any new solution?

@fangqi99
Copy link

Thank you @kawaiitakagi I think I have fixed it by https://github.com/layumi/University1652-Baseline/blob/master/folder.py#L102 The issue still exists? Do you find any new solution?

I also had a pause at a certain epoch when I ran train_no_street.py, here is my command line python train_no_street.py --name pretrained_two_view_my_model --share --views 2 --droprate 0.75 --stride 1 --h 256 --w 256 --gpu_ids 4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants