Description
Search before asking
- I have searched the HUB issues and discussions and found no similar questions.
Question
I'm looking for the best way to train a YOLO model for mandarin detection with my own dataset.
First, I tried with a pre-trained model (model=YOLO('yolo11s.pt')). I got fairly good results but there were a lot of peaks and troughs in the graphs (see image 1). The execution time was too long.
Second, I tried with the same pre-trained model but freezing the 24 layers of the YOLO model. I got better results (better mAP50-95 and better convergence) but there were still peaks and troughs (see image 2).
Finally, I trained a model from scratch (model=YOLO('yolo11s.yaml')) and got the best result so far (see image 3).
My question is: If you look at the vertical axis of the train/box_loss, val/box_loss, train/cls_loss, val/cls_loss, train/dfl_loss and val/dfl_loss graphs, they vary a lot between the pre-trained model and the new model. Does anyone know why these ranges are so different, one has values between 1 and 0, the other has values between 5 and 0?
(imagen 1)
(imagen 2)
(imagen 3)
Additional
No response