Sub-dataset selection method #2

BreziTasbi · 2024-05-16T18:02:24Z

To train this contrast classifier, I have access to a vast dataset sourced from NeuroPoly servers and OpenNeuro. To maximize the utility of this data, I aim to create a balanced and diverse dataset to develop a robust model. I particularly want the model to learn the relationship between image content and contrast, rather than the specific characteristics of my sub-dataset and the contrast (such as resolution, orientation, framing).

- Balance among contrasts will be ensured by assigning weights relative to their representation in the dataset (upsampling).

- Data augmentation will simulate variations in framing, orientation, and resolution through random crops, rotations, and downscalings.

- I will estimate the dataset's bias based on different characteristics by evaluating the performance of basic classifiers trained exclusively with these data. The worse these classifiers, the better the dataset.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sub-dataset selection method #2

Sub-dataset selection method #2

BreziTasbi commented May 16, 2024

Sub-dataset selection method #2

Sub-dataset selection method #2

Comments

BreziTasbi commented May 16, 2024