Search before asking
Question
Hi,
I have two related questions about the built-in Ultralytics trainer:
1. Training uses all images, not just annotated ones
I have a dataset of ~600 images, of which 115 are annotated (visible in the Data tab as "115 Labels in Total" and marked as done in the bottom-right corner). However, after training I noticed that all 600 images were included in the internal dataset generated under xanylabeling_data/trainer/ultralytics/datasets/detect/.
Is there a parameter, option, or workaround to make the trainer only include annotated images? My goal is to train a custom model on the 115 annotated images and use it for auto-labeling the remaining ones.
2. What should the data.yaml contain / what is actually used?
In the Config tab, the Data field requires a data.yaml file. I exported my dataset using "Export YOLO-HBB Annotations" with "Skip empty labels" enabled, and created a data.yaml pointing to the exported folder. However, the trainer ignores the path, train, and val fields and generates its own internal dataset instead.
What is the data.yaml actually used for in this context? Is only nc and names relevant? And what is the recommended workflow here?
Thanks!
Additional
No response
Search before asking
Question
Hi,
I have two related questions about the built-in Ultralytics trainer:
1. Training uses all images, not just annotated ones
I have a dataset of ~600 images, of which 115 are annotated (visible in the Data tab as "115 Labels in Total" and marked as done in the bottom-right corner). However, after training I noticed that all 600 images were included in the internal dataset generated under
xanylabeling_data/trainer/ultralytics/datasets/detect/.Is there a parameter, option, or workaround to make the trainer only include annotated images? My goal is to train a custom model on the 115 annotated images and use it for auto-labeling the remaining ones.
2. What should the data.yaml contain / what is actually used?
In the Config tab, the
Datafield requires adata.yamlfile. I exported my dataset using "Export YOLO-HBB Annotations" with "Skip empty labels" enabled, and created adata.yamlpointing to the exported folder. However, the trainer ignores thepath,train, andvalfields and generates its own internal dataset instead.What is the
data.yamlactually used for in this context? Is onlyncandnamesrelevant? And what is the recommended workflow here?Thanks!
Additional
No response