Why is yolov5 training slow? #10254

daigang896 · 2022-11-22T07:56:29Z

Search before asking

I have searched the YOLOv5 issues and discussions and found no similar questions.

Question

Why is yolov5 training slow? Use the yolov5m6 pretraining model. Does anyone have the same problem?

Additional

No response

glenn-jocher · 2022-11-22T18:53:51Z

👋 Hello! Thanks for asking about training speed issues. YOLOv5 🚀 can be trained on CPU (slowest), single-GPU, or multi-GPU (fastest). If you would like to increase your training speed some options are:

Increase --batch-size
Reduce --img-size
Reduce model size, i.e. from YOLOv5x -> YOLOv5l -> YOLOv5m -> YOLOv5s
Train with multi-GPU DDP at larger --batch-size
Train on cached data: python train.py --cache (RAM caching) or --cache disk (disk caching)
Train on faster GPUs, i.e.: P100 -> V100 -> A100
Train on free GPU backends with up to 16GB of CUDA memory:

Good luck 🍀 and let us know if you have any other questions!

daigang896 · 2022-11-23T03:19:01Z

Hello.
Train on a NVIDIA RTX A6000 48G card. When the batchsize is increased, the speed of each iteration becomes slow. But the GPU memory is sufficient, but the speed can not be improved. What is the bottleneck?

Laughing-q · 2022-11-23T07:30:09Z

@daigang896 It seems the bottleneck is data-loading as you've increased batch-size, maybe using more workers will help you.

yolov5/train.py

Line 457 in 7398d2d

    
           parser.add_argument('--workers', type=int, default=8, help='max dataloader workers (per RANK in DDP mode)')

glenn-jocher · 2022-11-23T13:10:03Z

@daigang896 also try --cache ram or --cache disk to reduce dataloading bottlenecks.

daigang896 · 2022-11-24T03:31:08Z

Thanks, I'll try it.

daigang896 · 2022-11-24T07:06:19Z

@glenn-jocher @Laughing-q
Hello，
More --workers values did not work. The set --workers==--batchsize=16 did not find that the training speed of each iteration was faster. The CPU utilization is low. I don't know what the problem is.

daigang896 · 2022-11-24T07:19:22Z

@glenn-jocher
Try -- cache ram found insufficient memory, try -- cache disk found no improvement in training speed.

David-19940718 · 2022-11-25T01:45:08Z

@glenn-jocher Try -- cache ram found insufficient memory, try -- cache disk found no improvement in training speed.

For the most situation, follow the instructions by the author advise will be tackled.

In your case, I think that it may be caused by your machine, you can try another machine and repeate once time if supported.

Note that, it is very important to load the data into the memory, so, don't forget to add this line -- cache ram.

daigang896 · 2022-11-28T01:54:59Z

Hello.
At present, the training speed has been significantly improved and the problem has been solved by updating the card driver, cuda, cudnn and pytorch, and using yolov5 6.2 code.

glenn-jocher · 2022-11-30T03:09:20Z

@daigang896 great!!

github-actions · 2022-12-31T00:20:29Z

👋 Hello, this issue has been automatically marked as stale because it has not had recent activity. Please note it will be closed if no further activity occurs.

Access additional YOLOv5 🚀 resources:

Wiki – https://github.com/ultralytics/yolov5/wiki
Tutorials – https://docs.ultralytics.com/yolov5
Docs – https://docs.ultralytics.com

Access additional Ultralytics ⚡ resources:

Ultralytics HUB – https://ultralytics.com/hub
Vision API – https://ultralytics.com/yolov5
About Us – https://ultralytics.com/about
Join Our Team – https://ultralytics.com/work
Contact Us – https://ultralytics.com/contact

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐!

Robotatron · 2023-01-08T23:02:22Z

I saw no difference in training time with --cache or without when using a SSD, interesting.
Also using a smaller image size (e.g. 160 with batch size of 960) was training SLOWER then using a bigger image size (e.g. 240 with bs of 221)

bartlomiejgadzicki-digica · 2023-02-02T03:43:01Z

Hi there @glenn-jocher, do symlinks affect training speed? I keep multiple versions of my datasets and this way I can avoid storing the same images multiple times. Do you think it can be harmful in any way?

github-actions · 2023-03-05T00:26:17Z

👋 Hello, this issue has been automatically marked as stale because it has not had recent activity. Please note it will be closed if no further activity occurs.

Access additional YOLOv5 🚀 resources:

Wiki – https://github.com/ultralytics/yolov5/wiki
Tutorials – https://docs.ultralytics.com/yolov5
Docs – https://docs.ultralytics.com

Access additional Ultralytics ⚡ resources:

Ultralytics HUB – https://ultralytics.com/hub
Vision API – https://ultralytics.com/yolov5
About Us – https://ultralytics.com/about
Join Our Team – https://ultralytics.com/work
Contact Us – https://ultralytics.com/contact

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐!

glenn-jocher · 2023-11-15T08:09:02Z

@bartlomiejgadzicki-digica symlinks generally do not affect training speed significantly, as they are simply pointers to the original data. However, they can introduce a slight overhead during data loading, so their impact on training speed might be negligible. Maintaining multiple dataset versions through symlinking is a smart storage solution. As long as your data loading and training procedures are not impacted, feel free to continue using symlinks to efficiently manage your datasets.

daigang896 added the question Further information is requested label Nov 22, 2022

github-actions bot added the Stale Stale and schedule for closing soon label Dec 31, 2022

github-actions bot removed the Stale Stale and schedule for closing soon label Jan 9, 2023

jerome-white mentioned this issue Mar 4, 2023

Path resolution in the dataloader #11115

Closed

1 task

github-actions bot added the Stale Stale and schedule for closing soon label Mar 5, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is yolov5 training slow? #10254

Why is yolov5 training slow? #10254

daigang896 commented Nov 22, 2022

glenn-jocher commented Nov 22, 2022 •

edited by UltralyticsAssistant

Loading

daigang896 commented Nov 23, 2022

Laughing-q commented Nov 23, 2022

glenn-jocher commented Nov 23, 2022

daigang896 commented Nov 24, 2022

daigang896 commented Nov 24, 2022

daigang896 commented Nov 24, 2022

David-19940718 commented Nov 25, 2022 •

edited

Loading

daigang896 commented Nov 28, 2022

glenn-jocher commented Nov 30, 2022

github-actions bot commented Dec 31, 2022 •

edited by glenn-jocher

Loading

Robotatron commented Jan 8, 2023

bartlomiejgadzicki-digica commented Feb 2, 2023

github-actions bot commented Mar 5, 2023 •

edited by glenn-jocher

Loading

glenn-jocher commented Nov 15, 2023

Why is yolov5 training slow? #10254

Why is yolov5 training slow? #10254

Comments

daigang896 commented Nov 22, 2022

Search before asking

Question

Additional

glenn-jocher commented Nov 22, 2022 • edited by UltralyticsAssistant Loading

daigang896 commented Nov 23, 2022

Laughing-q commented Nov 23, 2022

glenn-jocher commented Nov 23, 2022

daigang896 commented Nov 24, 2022

daigang896 commented Nov 24, 2022

daigang896 commented Nov 24, 2022

David-19940718 commented Nov 25, 2022 • edited Loading

daigang896 commented Nov 28, 2022

glenn-jocher commented Nov 30, 2022

github-actions bot commented Dec 31, 2022 • edited by glenn-jocher Loading

Robotatron commented Jan 8, 2023

bartlomiejgadzicki-digica commented Feb 2, 2023

github-actions bot commented Mar 5, 2023 • edited by glenn-jocher Loading

glenn-jocher commented Nov 15, 2023

glenn-jocher commented Nov 22, 2022 •

edited by UltralyticsAssistant

Loading

David-19940718 commented Nov 25, 2022 •

edited

Loading

github-actions bot commented Dec 31, 2022 •

edited by glenn-jocher

Loading

github-actions bot commented Mar 5, 2023 •

edited by glenn-jocher

Loading